I think it should be primary, but I still can not find my head around it Suppose HTML documents have the proper amount of documents and I have to catch every image URL from them.
The remaining content changes, but the basis of the URL is always for example: Http://images.examplesite.com/images/ ,
& lt; A href = '' & gt; or & lt; Img src = '' & gt; are mixed with tags, so how can I leave them out? Maybe preg_match ?
Try something like: preg_match_all ('/ Http: \ / \ / images \ .examplesite \ .com \ / images \ / (. *?) "/ I ', $ html_data, $ result, PREG_SET_ORDER)
Comments
Post a Comment