Can I extract the Data?

Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.
Forum rules
Before asking a question or reporting an issue:
1. Please review the list of FAQ's.
2. Use the search box (at the top of each forum page) to see if a similar problem or question has already been addressed.
3. Try searching the iMacros Wiki - it contains the complete iMacros reference as well as plenty of samples and tutorials.
4. We can respond much faster to your posts if you include the following information: CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST
Post Reply
BillHamaker
Posts: 13
Joined: Fri Jul 12, 2013 4:54 pm

Can I extract the Data?

Post by BillHamaker » Mon Jul 23, 2018 7:37 pm

I am unable to extract the data from this web site...

https://portal.assessor.lacounty.gov/pa ... 4328032014

I don't know what technology they are using for the web page. The HTML source for the page shows a single line including the text "ngview". Looking up this on the internet it is a directive used by AngularJS but that could just be a coincidence and they could be doing something else.

Is there anyway to extract the data on this website with iMacros?
chivracq
Posts: 8636
Joined: Sat Apr 13, 2013 1:07 pm
Location: Amsterdam (NL)

Re: Can I extract the Data?

Post by chivracq » Mon Jul 23, 2018 9:00 pm

BillHamaker wrote:I am unable to extract the data from this web site...

https://portal.assessor.lacounty.gov/pa ... 4328032014

I don't know what technology they are using for the web page. The HTML source for the page shows a single line including the text "ngview". Looking up this on the internet it is a directive used by AngularJS but that could just be a coincidence and they could be doing something else.

Is there anyway to extract the data on this website with iMacros?
No Pb for me from some quick Check to extract a few Fields on that Page..., looks pretty straightforward to me... :?

Code: Select all

VERSION BUILD=8820413 RECORDER=FX
SET !EXTRACT_TEST_POPUP NO
SET !TIMEOUT_STEP 0
TAB T=1
'URL GOTO=https://portal.assessor.lacounty.gov/parceldetail/4328032014
TAG POS=1 TYPE=B ATTR=TXT:AIN: EXTRACT=TXT
TAG POS=2 TYPE=DIV ATTR=TXT:AIN:<SP>4328-032-014<SP>4 EXTRACT=TXT
TAG POS=1 TYPE=B ATTR=TXT:Situs<SP>Address: EXTRACT=TXT
TAG POS=1 TYPE=ADDRESS ATTR=TXT:Situs<SP>Address:<SP>9504<SP>WILSHIRE<SP>BLVD<SP>BEVERLY* EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Use<SP>Type: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:Commercial EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Parcel<SP>Type: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:Regular<SP>Fee<SP>Parcel EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Tax<SP>Rate<SP>Area: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:02410 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Parcel<SP>Status: EXTRACT=TXT
TAG POS=1 TYPE=B ATTR=TXT:ACTIVE EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Create<SP>Date: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=CLASS:ng-binding&&TXT: EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Delete<SP>Date: EXTRACT=TXT
TAG POS=2 TYPE=DD ATTR=CLASS:ng-binding&&TXT: EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Tax<SP>Status: EXTRACT=TXT
TAG POS=1 TYPE=B ATTR=TXT:CURRENT EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Year<SP>Defaulted: EXTRACT=TXT
TAG POS=3 TYPE=DD ATTR=CLASS:ng-binding&&TXT: EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Exemption: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:None EXTRACT=TXT
TAG POS=1 TYPE=B ATTR=TXT:Building<SP>(0103)<SP>&<SP>Land<SP>Overview EXTRACT=TXT
TAG POS=1 TYPE=DIV ATTR=TXT:Building<SP>(0103)<SP>&<SP>Land<SP>Overview EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Use<SP>Code: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:1810 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Design<SP>Type: EXTRACT=TXT
TAG POS=2 TYPE=DD ATTR=TXT:1810 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Quality<SP>Class: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:AX EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:#<SP>of<SP>Units: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:218 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Beds/Baths: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:0/0 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Building<SP>SqFt: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:247,349 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Year<SP>Built: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:1930 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Effective<SP>Year: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:1949 EXTRACT=TXT
TAG POS=1 TYPE=DT ATTR=TXT:Land<SP>SqFt: EXTRACT=TXT
TAG POS=1 TYPE=DD ATTR=TXT:82,245 EXTRACT=TXT

PROMPT {{!EXTRACT}}
(Tested on iMacros for FF v8.8.2, Pale Moon v26.3.3 (=FF47), Win10_x64.)
- (F)CI(M) = (Full) Config Info (Missing): iMacros + Browser + OS (+ all 3 Versions + 'Free'/'PE').
- I don't even read the Qt if that (required) Info is not mentioned...!
- Script & URL help a lot for more "educated" Help...
Post Reply