tcorpus

tcorpus is a collection of high level tools for text corpus preparation and discourse analysis. It is being developed and used for research at the chair of Economic Geography and Sustainable Development, University of Freiburg. Things may change and break regularly, but you are welcome to see if any of it is useful.

The package relies on several dependencies for performing natural language processing tasks. Amongst other dependencies, it uses

flair for named entity recognition
syntok for segmentation and tokenization
NLTK for parsing grammatical structures

While tcorpus is free to use and distribute under an MIT License, this may not be the case for all dependencies. Please consider if depedency licenses cover your use case.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
tcorpus		tcorpus
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tcorpus

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

wiertz/tcorpus

Folders and files

Latest commit

History

Repository files navigation

tcorpus

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages