Java HR Information Extractor

The goal of this project is to create a program that can automatically extract relevant information from a document.

Information extraction (IE) is the task of automatically extracting structured information from unstructured and/or semi-structured machine-readable documents. This activity concerns processing human language texts by means of natural language processing (NLP).

Due to the difficulty of the problem, in this project we will focus on extracting information from the domain of Human Resources.

The programming language used is Java. Optionally a React frontend can be developed as well. In this cross-departmental project HR Officers will be working alongside Java Developers and Frontend Developers to create a domain specific program.

Source: https://en.wikipedia.org/wiki/Information_extraction

Keywords

documents, extracting, Frontend developer, human resource management, human resources officer, information extraction, Java, Java Software Developer, machine-readable, Natural language processing, React, semi-structured, Structured data, unstructured data