Rule Based Metadata Extraction Framework from Academic Articles

This article proposes a free, open-source Java-based metadata extraction framework that uses layout and rule-based methods to extract essential metadata from PDFs, including titles, abstracts, keywords, and references. It emphasizes the speed and accuracy of this framework, making it suitable for digital libraries and research databases. The article provides an in-depth review of the framework’s capabilities, highlighting its precision in extracting metadata from scientific documents. By using this framework, organizations can improve the efficiency of their metadata extraction processes, enabling better organization, retrieval, and analysis of research data.

Author(s) :

Azimjonov, J., Alikhanov, J.

Yes

Get in touch with authors

No ratings yet

Rate this article

Yes

Key topics

Data Science for social impact

Also found in

Share

Join Our Newsletter

Explore More Articles

In this age of AI, India’s Women Are Being Left Behind inSTEM and Skilling

‘In Fact’ is a quarterly newsletter by ISDM DataShakti. ISDM DataShakti, powered by Capgemini, is a pioneering single-window SDG data platform that makes SDG data easily accessible to social sector professionals like you, so you can focus on creating change on the ground.
Blog

An Urgent Call for Digital Literacy

‘In Fact’ is a quarterly newsletter by ISDM DataShakti. ISDM DataShakti, powered by Capgemini, is a pioneering single-window SDG data platform that makes SDG data easily accessible to social sector professionals like you, so you can focus on creating change on the ground.
Blog

Why India needs to start washing its hands more

‘In Fact’ is a quarterly newsletter by ISDM DataShakti. ISDM DataShakti, powered by Capgemini, is a pioneering single-window SDG data platform that makes SDG data easily accessible to social sector professionals like you, so you can focus on creating change on the ground.
Blog

Double trouble: Why India urgently needs policies to address the challenges of bothits youth, and elderly population

‘In Fact’ is a quarterly newsletter by ISDM DataSights. ISDM DataSights is a pioneering single-window SDG data platform that democratises data access for the social sector, developed by the Indian School of Development Management (ISDM), and powered by Capgemini.
We use essential and analytics cookies to operate this website and understand how visitors interact with it. As this site also functions as a login identity provider (IDP) for other ISDM portals, some cookies are necessary to enable secure authentication. By continuing to use this site, you consent to our use of cookies.