EXTRACT <TD> WhitOut Class | Inside <tbody>

Discussions and Tech Support related to website data extraction, screen scraping and data mining using iMacros.
Forum rules
iMacros EOL - Attention!

The renewal maintenance has officially ended for Progress iMacros effective November 20, 2023 and all versions of iMacros are now considered EOL (End-of-Life). The iMacros products will no longer be supported by Progress (aside from customer license issues), and these forums will also no longer be moderated from the Progress side.

Thank you again for your business and support.

Sincerely,
The Progress Team

Before asking a question or reporting an issue:
1. Please review the list of FAQ's.
2. Use the search box (at the top of each forum page) to see if a similar problem or question has already been addressed.
3. Try searching the iMacros Wiki - it contains the complete iMacros reference as well as plenty of samples and tutorials.
4. We can respond much faster to your posts if you include the following information: CLICK HERE FOR IMPORTANT INFORMATION TO INCLUDE IN YOUR POST
Post Reply
p0st3r
Posts: 6
Joined: Mon Aug 13, 2018 3:02 am

EXTRACT <TD> WhitOut Class | Inside <tbody>

Post by p0st3r » Mon Aug 13, 2018 3:10 am

Updated: I fixit error, thank for seeing post.
Hi everyone.
I have a problem, extract data from website to CSV file but in the website all data wanto get no have Class dev, all code inside <td>
I have a code in the below, please see and help me.

Thank so much.

Code: Select all

<tbody><tr>
					<td><b>Name: </b></td>
					<td>[b]ratedsegment[/b]</td>
				</tr>
				<tr>
					<td><b>Posts: </b></td>
					<td>0</td>
				</tr><tr>
				</tr><tr>
					<td><b>Position: </b></td>
					<td>Brand new</td>
				</tr>
				<tr>
					<td><b>Date Registered: </b></td>
					<td>[b]August 12, 2018, 10:56:23 AM[/b]</td>
				</tr><tr>
					<td><b>Last Active: </b></td>
					<td><b>Today</b> at 02:57:47 AM</td>
				</tr>
				<tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr><tr>
					<td><b>ICQ:</b></td>
					<td></td>
				</tr><tr>
					<td><b>AIM: </b></td>
					<td></td>
				</tr><tr>
					<td><b>MSN: </b></td>
					<td></td>
				</tr><tr>
					<td><b>YIM: </b></td>
					<td></td>
				</tr><tr>
					<td><b>Email: </b></td>
					<td>
						<i><a href="mailto:cfeuer@chiuyu.org">[b]cfeuer@chiuyu.org[/b]</a></i>
					</td>
				</tr><tr>
					<td><b>Website: </b></td>
					<td><a href=""></a></td>
				</tr><tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr><tr>
					<td><b>Gender: </b></td>
					<td></td>
				</tr><tr>
					<td><b>Age:</b></td>
					<td>N/A</td>
				</tr><tr>
					<td><b>Location:</b></td>
					<td></td>
				</tr><tr>
					<td><b>Local Time:</b></td>
					<td>August 13, 2018, 02:57:47 AM</td>
				</tr><tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr>
				<tr>
					<td colspan="2" height="25">
						<table style="table-layout: fixed;" width="100%" cellspacing="0" cellpadding="0" border="0">
							<tbody><tr>
								<td style="padding-bottom: 0.5ex;"><b>Signature:</b></td>
							</tr><tr>
								<td colspan="2" class="smalltext" width="100%"><div class="signature"></div></td>
							</tr>
						</tbody></table>
					</td>
				</tr>
			</tbody>
I want to get data i choose BOLD in the code.
chivracq
Posts: 10301
Joined: Sat Apr 13, 2013 1:07 pm
Location: Amsterdam (NL)

Re: EXTRACT <TD> WhitOut Class | Inside <tbody>

Post by chivracq » Mon Aug 13, 2018 12:16 pm

p0st3r wrote:Hi everyone.
I have a problem, extract data from website to CSV file but in the website all data wanto get no have Class dev, all code inside <td>
I have a code in the below, please see and help me.

Thank so much.

Code: Select all

<tbody><tr>
					<td><b>Name: </b></td>
					<td>[b]ratedsegment[/b]</td>
				</tr>
				<tr>
					<td><b>Posts: </b></td>
					<td>0</td>
				</tr><tr>
				</tr><tr>
					<td><b>Position: </b></td>
					<td>Brand new</td>
				</tr>
				<tr>
					<td><b>Date Registered: </b></td>
					<td>[b]August 12, 2018, 10:56:23 AM[/b]</td>
				</tr><tr>
					<td><b>Last Active: </b></td>
					<td><b>Today</b> at 02:57:47 AM</td>
				</tr>
				<tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr><tr>
					<td><b>ICQ:</b></td>
					<td></td>
				</tr><tr>
					<td><b>AIM: </b></td>
					<td></td>
				</tr><tr>
					<td><b>MSN: </b></td>
					<td></td>
				</tr><tr>
					<td><b>YIM: </b></td>
					<td></td>
				</tr><tr>
					<td><b>Email: </b></td>
					<td>
						<i><a href="mailto:cfeuer@chiuyu.org">[b]cfeuer@chiuyu.org[/b]</a></i>
					</td>
				</tr><tr>
					<td><b>Website: </b></td>
					<td><a href=""></a></td>
				</tr><tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr><tr>
					<td><b>Gender: </b></td>
					<td></td>
				</tr><tr>
					<td><b>Age:</b></td>
					<td>N/A</td>
				</tr><tr>
					<td><b>Location:</b></td>
					<td></td>
				</tr><tr>
					<td><b>Local Time:</b></td>
					<td>August 13, 2018, 02:57:47 AM</td>
				</tr><tr>
					<td colspan="2"><hr class="hrcolor" width="100%" size="1"></td>
				</tr>
				<tr>
					<td colspan="2" height="25">
						<table style="table-layout: fixed;" width="100%" cellspacing="0" cellpadding="0" border="0">
							<tbody><tr>
								<td style="padding-bottom: 0.5ex;"><b>Signature:</b></td>
							</tr><tr>
								<td colspan="2" class="smalltext" width="100%"><div class="signature"></div></td>
							</tr>
						</tbody></table>
					</td>
				</tr>
			</tbody>
I want to get data i choose BOLD in the code.
CIM...! :mrgreen: (Read my Sig...)

Same Answer more or less I would think from a quick look, like in this similar Thread...:
- Re: extracting data same web but different pages
=> Either extract the whole Table as it looks to me like you want to extract the whole Content..., and which will keep the "original" Format..., or use 'Relative Positioning' with 'R1' in your Case using each "Label" as Anchor..., if you want to "omit" a few Fields and/or prefer to "reorganize" the Data a bit differently than the original Format...

This is a very "typical" Extraction Scenario, no Difficulty at all... What is your "Problem"...? (No Script posted...) :?
- (F)CI(M) = (Full) Config Info (Missing): iMacros + Browser + OS (+ all 3 Versions + 'Free'/'PE'/'Trial').
- FCI not mentioned: I don't even read the Qt...! (or only to catch Spam!)
- Script & URL help a lot for more "educated" Help...
chivracq
Posts: 10301
Joined: Sat Apr 13, 2013 1:07 pm
Location: Amsterdam (NL)

Re: EXTRACT <TD> WhitOut Class | Inside <tbody>

Post by chivracq » Mon Aug 13, 2018 12:23 pm

p0st3r wrote:Updated: I fixit error, thank for seeing post.
Oh...!, then explain a bit what was the "Error" or "Problem" and how you "fixed it"...? And share your Solution... 8)
- (F)CI(M) = (Full) Config Info (Missing): iMacros + Browser + OS (+ all 3 Versions + 'Free'/'PE'/'Trial').
- FCI not mentioned: I don't even read the Qt...! (or only to catch Spam!)
- Script & URL help a lot for more "educated" Help...
Post Reply