Search found 29 matches

by Chilly_Bang
Mon Jun 24, 2019 1:45 pm
Forum: Data Extraction and Web Screen Scraping
Topic: Scrape with smart XPath
Replies: 3
Views: 6413

Re: Scrape with smart XPath

FCI in signature is up to date. 9.0.3. is latest stable version i use and since some years driving good with it - i mean, have no causes to complain, even as with FF45... but 'EXTRACT' always only extracts 1 Element at the time, Aha, that is my problem, not the XPath...! Good to know - every day som...
by Chilly_Bang
Sun Jun 23, 2019 7:58 pm
Forum: Data Extraction and Web Screen Scraping
Topic: Scrape with smart XPath
Replies: 3
Views: 6413

Scrape with smart XPath

Usually i scrape with a tool, which understands "smart" XPath - means, if i set scraper to scrape XPath //div[@class='example-class'] , so i get content from all occurences of this class. I really need this, because the site i scrape has different amount of this class occurences - from zer...
by Chilly_Bang
Fri May 17, 2019 1:52 pm
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

Oh-oh, thecoder2012 wasn't there yet... :( And how about you, @chivracq - do the mood and social life allow to look a bit deeper there?
by Chilly_Bang
Fri May 10, 2019 10:02 am
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

it plagues me while my whole life in it - can't get working code parts work together.
But sure, your 2cts brought substantial light into thing - i haven't had it on scope, there are images, which come from different sources.
by Chilly_Bang
Fri May 10, 2019 8:32 am
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

if my last Reply was a bit "useful"
Surely, i was coming to idea (how) to fetch background images after you pointed me to.

This script is very simply to check - throw it into console and you'll see nice output.
by Chilly_Bang
Thu May 09, 2019 3:39 pm
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

With the following function i successfully get all background images and their width and height into console: function getBgImgs (doc) { const srcChecker = /url\(\s*?['"]?\s*?(\S+?)\s*?["']?\s*?\)/i return Array.from( Array.from(doc.querySelectorAll('*')) .reduce((collection, node) => { le...
by Chilly_Bang
Thu May 09, 2019 2:04 pm
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

On extensively using this imacros i even realized, that some SVG images aren't scraped. Try the imacros code on the page "https:// www. flyer alarm.com/at/callback" (delete backspaces from the middle of url) - undependently of waiting time and loading status there are only first four image...
by Chilly_Bang
Mon May 06, 2019 11:33 am
Forum: iMacros for Firefox
Topic: Two loops in one imacros
Replies: 5
Views: 5719

Re: Two loops in one imacros

you only need to add another 'for' Loop, or a 'while' or 'until' "Outer" Loop around your existing 'for' "Inner" Loop I'm afraid, what looks as clarity for you, is for me pretty foggy... Germans say "i'm on the end with my Latin":) I tested some kinds of notation - the...
by Chilly_Bang
Sat May 04, 2019 5:27 pm
Forum: iMacros for Firefox
Topic: Two loops in one imacros
Replies: 5
Views: 5719

Re: Two loops in one imacros

as I guess the Nb of Links on each Page will probably be variable
Correct.
to hard-code 7 or 8 or even 10 times ... But that Method can be a bit "slow"
Absolutely my thought. There could be more then 100 links to extract from one url, so hardcode is not a good idea...
by Chilly_Bang
Sat May 04, 2019 3:30 pm
Forum: iMacros for Firefox
Topic: Two loops in one imacros
Replies: 5
Views: 5719

Two loops in one imacros

i need to loop through urls, like URL GOTO=https://example.com/page/{{!LOOP}} which is always working. An then i want to loop through all links on every looped url and save them, like with TAG POS={{!LOOP}} TYPE=A ATTR=CLASS:list EXTRACT=HREF which isn't working - this extracts only the first occure...
by Chilly_Bang
Sat May 04, 2019 3:28 pm
Forum: iMacros for Firefox
Topic: Extract urls from certain domain
Replies: 6
Views: 5532

Re: Extract urls from certain domain

Thanks to all guys! While we were talking here, the source page was updated and inks got titles !!! So i got it with

Code: Select all

TYPE=A ATTR=TITLE:link<SP>title EXTRACT=HREF
by Chilly_Bang
Mon Apr 29, 2019 9:23 pm
Forum: iMacros for Firefox
Topic: Extract urls from certain domain
Replies: 6
Views: 5532

Extract urls from certain domain

Hi I'm on: Win7x64 FF52 iMacros: BUILD=8881205 I'm trying to do following: there are bunch of paginated pages - i want to extract from the source code of every page urls, which are from certain domain (not from other domains, which are existing too). Throgh the bunch of page i go easy with URL GOTO ...
by Chilly_Bang
Wed May 16, 2018 8:19 am
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

Well, for those who needs quick and dirty solution; take Notepad++ and
- Search for string,
- bookmark lines with matches,
- delete bookmarked lines-

For other who is looking for this updated solution - hope for TheCoder2012 will coming :)
by Chilly_Bang
Tue May 15, 2018 9:06 am
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

Hi guys, @thecoder2012 & chivracq ! I'm now running this imacro for purposes i've planned - it rocks, but... it is a bit more detailed as i can handle its results. The thing is: it gets even images and image info, which are not "normal" images, but are as data:image...base64 implemente...
by Chilly_Bang
Tue Apr 10, 2018 9:23 am
Forum: Data Extraction and Web Screen Scraping
Topic: Get image info
Replies: 28
Views: 35631

Re: Get image info

Whoohaa, i see the light! The whole working code is: VERSION BUILD=8881205 RECORDER=FX SET !TIMEOUT_STEP 0 SET !ERRORIGNORE YES TAB T=1 SET !DATASOURCE pages.csv SET !DATASOURCE_COLUMNS 1 SET !LOOP 1 SET !DATASOURCE_LINE {{!LOOP}} URL GOTO={{!COL1}} URL GOTO=javascript:(function(){let<SP>di<SP>=<SP>...