extract problem

Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.

Moderators: Community Moderators, iMacros Moderators

Forum rules
Before asking a question or reporting an issue:
1. Please review the list of FAQ's.
2. Use the Google search box (at the top of each forum page) to see if a similar problem or question has already been addressed. This will search the entire contents of the forums as well as the iMacros Wiki.
3. We can respond much faster to your posts if you include the following information:

CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST

Answering your own posts (e.g. attempting to "bump" your topic) drops your topic from the list of unanswered threads, so it may actually receive less views.

extract problem

by jiri on Wed Oct 12, 2005 7:48 am

i have problem to extract name of lake (Michigan) in
http://www.worldlakes.org/lakedetails.asp?lakeid=8509
it worked with version 3.7 but not now in 4.31.
Suggested search string is: <FONT color=#336699 size=4>*
but test and macro fail. More html before content:

<td height="35" valign="top" width="628"><img src="images/lakeprofileheader.gif" width="137" height="26" alt="Lake Profile"></td>
</tr>
<tr>
<td valign="top" width="628">
<p><font size="4" color="#336699">Michigan

Any idea? Thanks Jiri
jiri
 

by Tech Support on Thu Oct 13, 2005 3:24 am

We confirmed that this is a problem with V4.31. It will be fixed with the next update in about a week. But there is also a workaround:

In the extraction command, replace "#" with "*" and the extraction works:

Change

EXTRACT POS=1 TYPE=TXT ATTR=<FONT<SP>color=#336699<SP>size=4>*

to

EXTRACT POS=1 TYPE=TXT ATTR=<FONT<SP>color=*336699<SP>size=4>*
User avatar
Tech Support
 
Posts: 5003
Joined: Tue Sep 20, 2005 12:25 pm

Thank you!

by jiri on Thu Oct 13, 2005 6:54 am

Thank you, workaround works!
jiri
 


Return to Data Extraction and Web Screen Scraping

Who is online

Users browsing this forum: No registered users and 5 guests

cron
-->