extract problem

Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.
Forum rules
Before asking a question or reporting an issue:
1. Please review the list of FAQ's.
2. Use the search box (at the top of each forum page) to see if a similar problem or question has already been addressed.
3. Try searching the iMacros Wiki - it contains the complete iMacros reference as well as plenty of samples and tutorials.
4. We can respond much faster to your posts if you include the following information: CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST
Post Reply
jiri

extract problem

Post by jiri » Wed Oct 12, 2005 2:48 pm

i have problem to extract name of lake (Michigan) in
http://www.worldlakes.org/lakedetails.asp?lakeid=8509
it worked with version 3.7 but not now in 4.31.
Suggested search string is: <FONT color=#336699 size=4>*
but test and macro fail. More html before content:

<td height="35" valign="top" width="628"><img src="images/lakeprofileheader.gif" width="137" height="26" alt="Lake Profile"></td>
</tr>
<tr>
<td valign="top" width="628">
<p><font size="4" color="#336699">Michigan

Any idea? Thanks Jiri
User avatar
Tech Support
Posts: 4948
Joined: Tue Sep 20, 2005 7:25 pm
Contact:

Post by Tech Support » Thu Oct 13, 2005 10:24 am

We confirmed that this is a problem with V4.31. It will be fixed with the next update in about a week. But there is also a workaround:

In the extraction command, replace "#" with "*" and the extraction works:

Change

EXTRACT POS=1 TYPE=TXT ATTR=<FONT<SP>color=#336699<SP>size=4>*

to

EXTRACT POS=1 TYPE=TXT ATTR=<FONT<SP>color=*336699<SP>size=4>*
jiri

Thank you!

Post by jiri » Thu Oct 13, 2005 1:54 pm

Thank you, workaround works!
Post Reply