Dr. Inventor Text Mining Framework

Java library to bootstrap and support scientific publication mining

Natural Language processing Group (TALN)
Universitat Pompeu Fabra, Barcelona, Spain
developed in the context of Dr. Inventor Project
TO GET THE LATEST VERSION OF THE LIBRARY (CODE, DOCUMENTATION, EXAMPLE) GO TO:
Dr. Inventor Text Mining Framework 4.0 (10/04/2017)

From version 3.1 on the Dr. Inventor Text Mining Framework documentation has been moved to ReadTheDocs portal.



Dr. Inventor Text Mining Framework is a Java library that integrates several Document Engeneering and Natural Language Processing tools customized to enable and ease the analysis of the textual contents of scientific publications.
Dr. Inventor Text Mining Framework is a standalone Java library that enable users to process the contents of papers both in PDF and JATS XML format. Once imported a paper from a local file or a remote URL, the Framework automatically extracts and characterizes several aspects including: All these facets of a paper are automatically mined and can be easily accessed thanks to the set of method and classes defined by the Dr. Inventor Text Mining Framework.


Dr. Inventor Text Mining Framework is developed by TALN - UPF in the context of Dr. Inventor Project.
Are you using Dr. Inventor Framework to support any scientific publications analysis task?
Please, let us know by sending an email to: francesco.ronzano AT upf.edu.

To cite Dr. Inventor Framework:
Ronzano, F., & Saggion, H. (2015). Dr. Inventor Framework: Extracting Structured Information from Scientific Publications. In Discovery Science (pp. 209-220). Springer International Publishing. Web Link