Extract email address from Java script in html source using python

Steve Hayes hayesstw at telkomsa.net
Sat May 23 21:04:36 EDT 2015


On Sat, 23 May 2015 19:01:55 +1000, Chris Angelico <rosuav at gmail.com>
wrote:

>On Sat, May 23, 2015 at 4:46 PM, savitha devi <savithad8 at gmail.com> wrote:
>> I am developing a web scraper code using HTMLParser. I need to extract
>> text/email address from java script with in the HTMLCode.I am beginner level
>> in python coding and totally lost here. Need some help on this. The java
>> script code is as below:
>>
>> <script type='text/javascript'>
>>  //<!--
>>  document.getElementById('cloak48218').innerHTML = '';
>>  var prefix = 'ma' + 'il' + 'to';
>>  var path = 'hr' + 'ef' + '=';
>>  var addy48218 = 'info' + '@';
>>  addy48218 = addy48218 + 'tsv-neuried' + '.' +
>> 'de';
>>  document.getElementById('cloak48218').innerHTML += '<a ' + path + '\'' +
>> prefix + ':' + addy48218 + '\'>' + addy48218+'<\/a>';
>>  //-->
>
>This is deliberately being done to prevent scripted usage. What
>exactly are you needing to do this for?

To sell addresses to spammers, of course. 


-- 
Terms and conditions apply. 

Steve Hayes
hayesmstw at hotmail.com



More information about the Python-list mailing list