8000 GitHub - evaldask/lltk: Lithuanian Language ToolKit
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

evaldask/lltk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build St
7D45
atus Coverage Status Maintainability

LLTK

Lithuanian Language ToolKit

This is Python3 library for Lithuanian language analysis. Currently these functions are supported:

  • word stemming,
  • paragraph splitting into sentences,
  • text summarization based on smmry.

For summarization example please checkout example jupyter notebook.

Stemming and splitting is based on TokenMill work. So big thumbs up for these guys for opensourcing their tools.

Installing for development

  • Fork the project
  • Clone your forked project:
$ git clone git@github.com:evalkaz/lltk.git
  • Install development dependencies:
$ pip install -r requirements-dev.txt

Running tests

After cloned and installed development dependencies run tests just typing the following command:

$ pytest

Any contribution is welcome!

About

Lithuanian Language ToolKit

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  
0