Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

I want to create an offline app to show drugs information. The information I nee

ID: 657503 • Letter: I

Question

I want to create an offline app to show drugs information. The information I need to use is already available in wikipedia. But at this page, there are so many subpages (1000+) to be saved and in my opinion doing this task manually is not feasible.

Also note that I should save each drug in a formatted element in XML (with certain schema). Now I'm looking for a tool to facilitate this task. Does such a tool exist at all or not?

My Requirements

Gets a HTML page(s) and creates an XML document
Preferably free
Windows or Linux based

Explanation / Answer

I doubt you will find an off-the-shelf tool for this purpose, this is IMHO a too specific requirement. But there a lots of frameworks for each major programming language which help you to implement a web "scraper" or "crawler" by yourself.

For example, googling for "python web crawler" immediately showed up http://scrapy.org/, looking for "java web crawler" gave a link to crawler4j. Using such a framework, for someone with >4K points on stackoverflow it should be a matter of a few days at most to implement what you are looking for.

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote