domdocument - Scraping html using Domdoc + PHP -
i scrape following html
<div class="venue-event-list " rel="gb"> <div class="tracks-list"> <div class="single-track"> <a href="//livevideo.betfair.com/default.do?mi=119408124" target="_blank" class="live-video-link"><div class="bf-icon-live-video tag-i13n i13n-ltxt-lvid i13n-sec-gb i13n-tab-today" title="watch on betfair live video"></div></a> <div class="info-container"> <span class="track-name"> <a class="tag-i13n i13n-ltxt-meeting i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408124">lingfield</a> </span> <div class="races-list"> <div class="single-race" id="m-1_119408124"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408124" title="5f nursery | 7 runners">14:10</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408128"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408128" title="6f mdn stks | 11 runners">14:40</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408132"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408132" title="7f mdn stks | 6 runners">15:10</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408136"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408136" title="2m hcap | 12 runners">15:40</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408140"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408140" title="1m2f sell stks | 6 runners">16:10</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408144"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408144" title="1m3f hcap | 8 runners">16:40</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408148"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408148" title="1m1f hcap | 14 runners">17:10</a> </span> </div> </div> </div> </div> </div> <div class="tracks-list"> <div class="single-track"> <a href="//livevideo.betfair.com/default.do?mi=119408153" target="_blank" class="live-video-link"><div class="bf-icon-live-video tag-i13n i13n-ltxt-lvid i13n-sec-gb i13n-tab-today" title="watch on betfair live video"></div></a> <div class="info-container"> <span class="track-name"> <a class="tag-i13n i13n-ltxt-meeting i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408153">wolverhampton</a> </span> <div class="races-list"> <div class="single-race" id="m-1_119408153"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408153" title="5f mdn stks | 7 runners">14:20</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408157"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408157" title="1m6f hcap | 7 runners">14:50</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408161"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408161" title="1m4f sell stks | 5 runners">15:20</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408165"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408165" title="1m1f hcap | 13 runners">15:50</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408169"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408169" title="1m1f hcap | 11 runners">16:20</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408173"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408173" title="1m mdn stks | 11 runners">16:50</a> </span> <span class="separator">|</span> </div> <div class="single-race" id="m-1_119408177"> <span class="race-time link-text"> <a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408177" title="1m hcap | 13 runners">17:20</a> </span> </div> </div> </div> </div> </div>
i have used following code pull racename , time of race
$url = ""; $html = file_get_contents($url); $dom = new domdocument(); @$dom->loadhtml($html); $dom->preservewhitespace = false; $xpath = new domxpath($dom); //pull individual cards day //li class="rac-cardsclass="ix ixc" $getdropdown = '//div[contains(@class, "tracks-list")]'; $getdropdown2 = $xpath->query($getdropdown); //loop through each individual card foreach ($getdropdown2 $dropresults) { echo $dropresults->textcontent. "<br />"; }
what pull meeting name if link (shown below) contains "gb" , "today" (this within class text) -
> <a class="tag-i13n i13n-ltxt-meeting i13n-sec-gb i13n-tab-today" > href="/exchange/plus/#/horse-racing/market/1.119408124">lingfield</a>
so outcome lingfield... if true pull time of race , market id following :
<a class="race-link tag-i13n i13n-ltxt-race i13n-sec-gb i13n-tab-today" href="/exchange/plus/#/horse-racing/market/1.119408124" title="5f nursery | 7 runners">14:10</a>
so outcome be:
lingfield 14:10 1.119408124 lingfield 14:40 1.119408144 ............................. wolverhampton 14:20 1.119408153
$xpath->query("a[contains(@class,'gb') , contains(@class,'today')]");
it helpful.
Comments
Post a Comment