text-parsing.md
July 15, 2021 · View on GitHub
Bookmarks tagged [text-parsing]
www.codever.land/bookmarks/t/text-parsing
tablib
https://github.com/kennethreitz/tablib
A module for Tabular Datasets in XLS, CSV, JSON, YAML.
- tags: python, text-parsing
- :octocat: source code
openpyxl
https://openpyxl.readthedocs.io/en/stable/
A library for reading and writing Excel 2010 xlsx/xlsm/xltx/xltm files.
- tags: python, text-parsing, office
pyexcel
https://github.com/pyexcel/pyexcel
Providing one API for reading, manipulating and writing csv, ods, xls, xlsx and xlsm files.
- tags: python, text-parsing, office
- :octocat: source code
python-docx
https://github.com/python-openxml/python-docx
Reads, queries and modifies Microsoft Word 2007/2008 docx files.
- tags: python, text-parsing, office
- :octocat: source code
python-pptx
https://github.com/scanny/python-pptx
Python library for creating and updating PowerPoint (.pptx) files.
- tags: python, text-parsing, office
- :octocat: source code
unoconv
https://github.com/unoconv/unoconv
Convert between any document format supported by LibreOffice/OpenOffice.
- tags: python, text-parsing, office
- :octocat: source code
XlsxWriter
https://github.com/jmcnamara/XlsxWriter
A Python module for creating Excel .xlsx files.
- tags: python, text-parsing, office
- :octocat: source code
xlwings
https://github.com/ZoomerAnalytics/xlwings
A BSD-licensed library that makes it easy to call Python from Excel and vice versa.
- tags: python, text-parsing, office
- :octocat: source code
xlwt
https://github.com/python-excel/xlwt
Writing and reading data and formatting information from Excel files.
- tags: python, text-parsing, office
- :octocat: source code
PDFMiner
https://github.com/euske/pdfminer
A tool for extracting information from PDF documents.
- tags: python, text-parsing, pdf
- :octocat: source code
PyPDF2
https://github.com/mstamy2/PyPDF2
A library capable of splitting, merging and transforming PDF pages.
- tags: python, text-parsing, pdf
- :octocat: source code
ReportLab
https://www.reportlab.com/opensource/
Allowing Rapid creation of rich PDF documents.
- tags: python, text-parsing, pdf
Mistune
https://github.com/lepture/mistune
Fastest and full featured pure Python parsers of Markdown.
- tags: python, text-parsing, markdown
- :octocat: source code
Python-Markdown
https://github.com/waylan/Python-Markdown
A Python implementation of John Gruber’s Markdown.
- tags: python, text-parsing, markdown
- :octocat: source code
PyYAML
YAML implementations for Python.
- tags: python, text-parsing, yaml
csvkit
https://github.com/wireservice/csvkit
Utilities for converting to and working with CSV.
- tags: python, text-parsing, csv
- :octocat: source code
unp
https://github.com/mitsuhiko/unp
A command line tool that can unpack archives easily.
- tags: python, text-parsing, archive
- :octocat: source code