8000 GitHub - fandigunawan/ODTReader: Lightweight python module to allow extracting text from OpenDocument (odt) files.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

fandigunawan/ODTReader

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ODTReader

Lightweight python module for extracting raw text from OpenDocument (odt) files.

Linux, macOS and Windows platforms supported.

Just point at the file with the .odt extension and let it print it out for you.

Installation

$ pip install odtreader

Usage

You can simply use it by calling the odtToText() function. The file is parsed and the text returned to you as a unicode object.

Example:

from ODTReader.odtreader import odtToText

text = odtToText("path/to/file.odt")

It can also be used as a command line utility.

Example:

$ python odtreader.py path/to/file.odt
This is the contents of the odt file!

$ python odtreader.py path/to/file.odt -o outfile.txt
Contents written to 'outfile.txt'

Version Support

This module was tested on Python 2.7 and 3.6 as of now, although later versions should hopefully work.

License

GNU GPL v3.0 License, see LICENSE file.

About

Lightweight python module to allow extracting text from OpenDocument (odt) files.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%
0