php - Extracting specific <a href> URLs out of the document -


I think it should be primary, but I still can not find my head around it Suppose HTML documents have the proper amount of documents and I have to catch every image URL from them.

The remaining content changes, but the basis of the URL is always for example: Http://images.examplesite.com/images/ ,

< P> So I want to remove that part from that string. The problem is that they always & lt; A href = '' & gt; or & lt; Img src = '' & gt; are mixed with tags, so how can I leave them out? Maybe preg_match ?

Try something like: preg_match_all ('/ Http: \ / \ / images \ .examplesite \ .com \ / images \ / (. *?) "/ I ', $ html_data, $ result, PREG_SET_ORDER)


Comments