Failing to extract HREF. Please help.

Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.
Forum rules
Before asking a question or reporting an issue:
1. Please review the list of FAQ's.
2. Use the search box (at the top of each forum page) to see if a similar problem or question has already been addressed.
3. Try searching the iMacros Wiki - it contains the complete iMacros reference as well as plenty of samples and tutorials.
4. We can respond much faster to your posts if you include the following information: CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST
Post Reply
bugmenot666
Posts: 39
Joined: Wed Apr 08, 2009 2:01 pm

Failing to extract HREF. Please help.

Post by bugmenot666 » Mon Sep 27, 2010 1:36 am

Hi Tom, Hannes, everybody else.

I want to extract some href links from a dating site. They look something like this:

<a href=profil.php?id=102738 title='Succubus">image random text</A>
<a href=profil.php?id=729839" title='lonelygirl">image random text</A>

I wish to extract the number after id= in each case. There are about 20 of these links on each page..

I tried the extraction wizard but that gives me errors any way i try it, and I'm too thick to work around it.

Grateful for any help you can bestow on this.
bugmenot666
Posts: 39
Joined: Wed Apr 08, 2009 2:01 pm

Re: Failing to extract HREF. Please help.

Post by bugmenot666 » Mon Sep 27, 2010 1:49 am

Managed to solve it somehow.

This was the code used:

HREF in the wizard
A in the Type field
Then the attr:
TXT:*&&TITLE:*&&HREF:*

Good night
matts98
Posts: 13
Joined: Fri Sep 03, 2010 6:24 am

Re: Failing to extract HREF. Please help.

Post by matts98 » Mon Sep 27, 2010 9:37 am

Hello there,

You can try this code

Code: Select all

TAG POS=1 TYPE=A ATTR=TXT:*&&TITLE:*&&HREF:* EXTRACT=TXT
Hope it helps :wink:
Post Reply