/    Sign up×
Community /Pin to ProfileBookmark

PHP script to crawl URLs retrieved from xml files

Hi, I have collected a number of xml files from RSS feeds. I will retrieve the useful contents (e.g., title, URL link, …) from these xml files and store them into the MySQL database. However, the content / description in the xml files are just the summaries, which are not what I want. What I want for the content is the full text of the entries. Therefore, I am thinking of using a PHP script to crawl the URLs retrieved from the xml files, followed by extracting the full text. What should be the logic/algorithm of writing this php script ? ?

Thank you very much.

to post a comment
PHP

3 Comments(s)

Copy linkTweet thisAlerts:
@LiLcRaZyFuZzYJul 18.2006 — what version of PHP do you have?
Copy linkTweet thisAlerts:
@yunfannyauthorJul 18.2006 — both php4 and php5. I can switch from 1 version to another ?
Copy linkTweet thisAlerts:
@LiLcRaZyFuZzYJul 18.2006 — did you have a look at http://php.net/simplexml ? (requires PHP5)
×

Success!

Help @yunfanny spread the word by sharing this article on Twitter...

Tweet This
Sign in
Forgot password?
Sign in with TwitchSign in with GithubCreate Account
about: ({
version: 0.1.9 BETA 6.2,
whats_new: community page,
up_next: more Davinci•003 tasks,
coming_soon: events calendar,
social: @webDeveloperHQ
});

legal: ({
terms: of use,
privacy: policy
});
changelog: (
version: 0.1.9,
notes: added community page

version: 0.1.8,
notes: added Davinci•003

version: 0.1.7,
notes: upvote answers to bounties

version: 0.1.6,
notes: article editor refresh
)...
recent_tips: (
tipper: @meenaratha,
tipped: article
amount: 1000 SATS,

tipper: @meenaratha,
tipped: article
amount: 1000 SATS,

tipper: @AriseFacilitySolutions09,
tipped: article
amount: 1000 SATS,
)...