Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.
Forum rules
iMacros EOL - Attention!
The renewal maintenance has officially ended for Progress iMacros effective
November 20, 2023 and all versions of iMacros are now considered EOL (End-of-Life). The iMacros products will no longer be supported by Progress (aside from customer license issues), and these forums will also no longer be moderated from the Progress side.
Thank you again for your business and support.
Sincerely,
The Progress Team
Before asking a question or reporting an issue:
1. Please review the list of
FAQ's.
2. Use the search box (at the top of each forum page) to see if a similar problem or question has already been addressed.
3. Try searching the
iMacros Wiki - it contains the complete iMacros reference as well as plenty of samples and tutorials.
4. We can respond much faster to your posts if you include the following information:
CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST
-
newmember
- Posts: 20
- Joined: Mon Mar 17, 2014 7:01 am
Post
by newmember » Thu Mar 24, 2016 10:05 am
Hello,
Code: Select all
URL GOTO=https://chinesepod.com/tools/glossary/entry/ugly
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=1 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
TAG POS=2 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=2 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
TAG POS=3 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=3 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
SAVEAS TYPE=EXTRACT FOLDER=E:\ FILE=pinyin.csv
I use this code to extract text and URL but it always extract wrong URL.
So the text doesn't match the URL. How to fix it ? Thank you very much
-
andyh2003
- Posts: 9
- Joined: Thu Sep 17, 2015 9:26 am
Post
by andyh2003 » Thu Mar 24, 2016 10:20 am
try changing the TAG position by 1 or 2...
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=2 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
or
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=3 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
-
chivracq
- Posts: 10301
- Joined: Sat Apr 13, 2013 1:07 pm
- Location: Amsterdam (NL)
Post
by chivracq » Thu Mar 24, 2016 1:08 pm
newmember wrote:Hello,
Code: Select all
URL GOTO=https://chinesepod.com/tools/glossary/entry/ugly
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=1 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
TAG POS=2 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=2 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
TAG POS=3 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=3 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
SAVEAS TYPE=EXTRACT FOLDER=E:\ FILE=pinyin.csv
I use this code to extract text and URL but it always extract wrong URL.
So the text doesn't match the URL. How to fix it ? Thank you very much
CIM...!
(Wondering btw if you are still using Pale Moon (like me)...)
andyh2003 wrote:try changing the TAG position by 1 or 2...
Code: Select all
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=2 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
or
Code: Select all
TAG POS=1 TYPE=TD ATTR=TXT:* EXTRACT=TXT
TAG POS=3 TYPE=A ATTR=TXT:Download&&HREF:* EXTRACT=HREF
newmember wrote:Yes. It worked
Yep, but using Relative Positioning would probably be more reliable.
If any HTML Element is added on the Page with a Link, you will again extract the wrong URL...
- (F)CI(M) = (Full) Config Info (Missing): iMacros + Browser + OS (+ all 3 Versions + 'Free'/'PE'/'Trial').
- FCI not mentioned: I don't even read the Qt...! (or only to catch Spam!)
- Script & URL help a lot for more "educated" Help...