html - XPATH - Extract content between two positions with different indent (python) -


i'm trying data html file via url. here's example:

<html>     ...     <div class="start">          <!-- here.. -->         <p></p>         <p><a href=''></a>         <span></span         <br>         <!-- ..to here -->          <div class="end">             ...         </div>         ...         ...     </div>     ... </html> 

i'm trying the data directly under div class="start", don't know how, since div contains whole page. know, div class="end" comes right after data want. keep in mind don't want text in between, different elements, in case <p> & <span> & <a>. note element types may vary showing in html above.

google gave me different types of (without luck): '//*[preceding-sibling::div[@class="start"] , following-sibling::div[@class="end"]]'

you got close googling. looks want is

//div[@class="start"]/*[following-sibling::div[@class="end"]] 

since <div class="start"> parent (not sibling) of data want select, use div[@class="start"]/* in xpath, instead of *[preceding-sibling::div[@class="start"]].


Comments

Popular posts from this blog

python - TypeError: start must be a integer -

c# - DevExpress RepositoryItemComboBox BackColor property ignored -

django - Creating multiple model instances in DRF3 -