Jump to content

XPath variable help - to get DOM-Document up and running


dilbertone

Recommended Posts

am trying to scrape the datas from a webpage, but I get need to get all the data in the

 

http://www.aktive-buergerschaft.de/buergerstiftungen/unsere_leistungen/buergerstiftungsfinder

 

see this link - follow it and get full insights

 

include 'simple_html_dom.php';

$html1 = file_get_html('http://www.aktive-buergerschaft.de/buergerstiftungen/unsere_leistungen/buergerstiftungsfinder

 

');

 

$info1 = $html1->find('b[class=[what to enter herer ]',0);

 

well - what is wanted: i need to have all the data out of this site;:

 

see this link - follow it and get full insights

 

Bürgerstiftung Lebensraum Aachen

    rechtsfähige Stiftung des bürgerlichen Rechts

    Ansprechpartner: Hubert Schramm

    Alexanderstr. 69/ 71

    52062 Aachen

    Telefon: 0241 - 4500130

    Telefax: 0241 - 4500131

    Email: info@buergerstiftung-aachen.de

    www.buergerstiftung-aachen.de

    >> Weitere Details zu dieser Stiftung

 

Bürgerstiftung Achim

    rechtsfähige Stiftung des bürgerlichen Rechts

    Ansprechpartner: Helga Kühn

    Rotkehlchenstr. 72

    28832 Achim

    Telefon: 04202-84981

    Telefax: 04202-955210

    Email: info@buergerstiftung-achim.de

    www.buergerstiftung-achim.de

    >> Weitere Details zu dieser Stiftung

 

well - what is needed - i need to have the datas that are "behind" the link - is there any way to do this with a ease and understandable parser - one that can be understood and wrote by a newbie!?

 

that would be more than great

 

Well one word regardng Regex: I have not tooo much experience - but i guess DOM-Document is the smarter way here...

Link to comment
Share on other sites

I suggest hiring a programmer to do this for you, as you've been given solutions before and still can't figure it out.

 

It's not easy to scrape a webpage, as there are many variables to deal with. On top of your own code, you have to interpret someone else's markup. This isn't a task well suited for a beginner at PHP.

 

IMO DOM classes are harder than RegEx if all you want is statically formatted data scraping.

 

If you take the time to learn RegEx, you'll find your solution, and have a tool in your belt that is nearly irreplaceable.

 

http://www.google.com/search?q=learn+regex

Link to comment
Share on other sites

This thread is more than a year old. Please don't revive it unless you have something important to add.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.