Human Data Attribute Extraction App

This application is designed to extract specific data attributes related to humans from one or multiple DOCX documents. It utilizes the power of ChatGPT, an AI language model, to analyze the documents and identify relevant information. The app also ensures privacy by replacing the actual user information with masked text or Ethereum wallet IDs.

Requirments

Python 3.10 or Latest are required
Please insitall requirment_1.txt

pip install requirments_1.txt

An Healthy Brain :)

Instructions

Before running the application, please make the following changes:

1. Open the utils/const.py file in a text editor.
2. Locate the line INPUT_PATH = 'test/' and modify the value to the path where your DOCX files are located. This path will be the input directory from which the application will read the documents.
3. Find the line API_KEY = 'XXXXXXX' in the same file (utils/const.py) and replace 'XXXXXXX' with your actual ChatGPT API key. This key is required to connect to the ChatGPT service and perform language processing tasks.

It is essential to upload the DOCX files from the specified INPUT_PATH directory because the application uses Streamlit, a web application framework. Streamlit requires the files to be available in the same path specified during configuration.

Running the Application

To run the application and start the human data attribute extraction process, execute the following command in your terminal:

streamlit run main.py

This command will launch the application, and you can interact with it through your web browser.

Demo

Please watch the demo.webm

Credits

This application was developed by Mohammad Ali Abbas at waspak.co for Holland (A Fiverr Client).

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
test		test
utils		utils
.gitignore		.gitignore
Demo.webm		Demo.webm
Driver.py		Driver.py
FillesDB.py		FillesDB.py
GPTInfoExtractor.py		GPTInfoExtractor.py
InfoExtractor.py		InfoExtractor.py
Masker.py		Masker.py
Readme.md		Readme.md
WordProcessor.py		WordProcessor.py
__init__.py		__init__.py
data.csv		data.csv
data_enums.xlsx		data_enums.xlsx
default_enum.ipynb		default_enum.ipynb
default_enum_db.csv		default_enum_db.csv
main.py		main.py
requirements_1.txt		requirements_1.txt
waspak.jpg		waspak.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Human Data Attribute Extraction App

Requirments

Instructions

Running the Application

Demo

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Languages

m-aliabbas/NER_Project

Folders and files

Latest commit

History

Repository files navigation

Human Data Attribute Extraction App

Requirments

Instructions

Running the Application

Demo

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages