• You are not logged in.

#1 Feb. 11, 2014 06:40:22

jonbod
Registered: 2014-02-11
Posts: 1
Reputation: +  0  -
Profile   Send e-mail  

"extract multiple sets of data" problem

i want to extract multiple sets of data - but i also want to capture the breadcrumb at the top of the page and combine it with each set of data.

i want to scrape this page.

http://www.tesco.com/groceries/product/browse/default.aspx?N=4294793612&Ne=4294793660&lvl=3

i have been able to do a scrape by grouping all the data on the page. but i want to link the results for each record with the “breadcrumb” at the top of the page

 Groceries
Fresh Food
Fresh Vegetables
so that i can catagorise each record.  


what i'm getting at the moment is this:

Tesco Baby Carrots 200g,  £1.60,  (£8.00/kg)
  
what i want is

Groceries, Fresh Food,  Fresh Vegetables, Tesco Baby Carrots 200g,  £1.60,  (£8.00/kg)

how do i do that?

Offline

#2 Feb. 12, 2014 20:13:10

admin
Registered: 2012-03-15
Posts: 289
Reputation: +  1  -
Profile   Send e-mail  

"extract multiple sets of data" problem

Just make a “capture” to scrape “breadcrumb”, though “breadcrumb” is not in groups, you can still capture it.

Offline

Board footer

Moderator control

Powered by DjangoBB