I want to create an offline app to show drugs information. The information I nee
ID: 657503 • Letter: I
Question
I want to create an offline app to show drugs information. The information I need to use is already available in wikipedia. But at this page, there are so many subpages (1000+) to be saved and in my opinion doing this task manually is not feasible.
Also note that I should save each drug in a formatted element in XML (with certain schema). Now I'm looking for a tool to facilitate this task. Does such a tool exist at all or not?
My Requirements
Gets a HTML page(s) and creates an XML document
Preferably free
Windows or Linux based
Explanation / Answer
I doubt you will find an off-the-shelf tool for this purpose, this is IMHO a too specific requirement. But there a lots of frameworks for each major programming language which help you to implement a web "scraper" or "crawler" by yourself.
For example, googling for "python web crawler" immediately showed up http://scrapy.org/, looking for "java web crawler" gave a link to crawler4j. Using such a framework, for someone with >4K points on stackoverflow it should be a matter of a few days at most to implement what you are looking for.
Related Questions
drjack9650@gmail.com
Navigate
Integrity-first tutoring: explanations and feedback only — we do not complete graded work. Learn more.