Lightweight python module for extracting raw text from OpenDocument (odt) files.
Linux, macOS and Windows platforms supported.
Just point at the file with the .odt
extension and let it print it out for you.
$ pip install odtreader
You can simply use it by calling the odtToText()
function. The file is parsed and the text returned to you as a unicode
object.
Example:
from ODTReader.odtreader import odtToText
text = odtToText("path/to/file.odt")
It can also be used as a command line utility.
Example:
$ python odtreader.py path/to/file.odt This is the contents of the odt file! $ python odtreader.py path/to/file.odt -o outfile.txt Contents written to 'outfile.txt'
This module was tested on Python 2.7 and 3.6 as of now, although later versions should hopefully work.
GNU GPL v3.0 License, see LICENSE file.