pyexcel-cli - Let you focus on data at command line, instead of file formats
April 18, 2025 ยท View on GitHub
================================================================================ pyexcel-cli - Let you focus on data at command line, instead of file formats
.. image:: https://raw.githubusercontent.com/pyexcel/pyexcel.github.io/master/images/patreon.png :target: https://www.patreon.com/chfw
.. image:: https://raw.githubusercontent.com/pyexcel/pyexcel-mobans/master/images/awesome-badge.svg :target: https://awesome-python.com/#specific-formats-processing
.. image:: https://codecov.io/gh/pyexcel/pyexcel-cli/branch/master/graph/badge.svg :target: https://codecov.io/gh/pyexcel/pyexcel-cli
.. image:: https://badge.fury.io/py/pyexcel-cli.svg :target: https://pypi.org/project/pyexcel-cli
.. image:: https://pepy.tech/badge/pyexcel-cli/month :target: https://pepy.tech/project/pyexcel-cli
.. image:: https://img.shields.io/gitter/room/gitterHQ/gitter.svg :target: https://gitter.im/pyexcel/Lobby
.. image:: https://img.shields.io/static/v1?label=continuous%20templating&message=%E6%A8%A1%E7%89%88%E6%9B%B4%E6%96%B0&color=blue&style=flat-square :target: https://moban.readthedocs.io/en/latest/#at-scale-continous-templating-for-open-source-projects
.. image:: https://img.shields.io/static/v1?label=coding%20style&message=black&color=black&style=flat-square :target: https://github.com/psf/black .. image:: https://readthedocs.org/projects/pyexcel-cli/badge/?version=latest :target: http://pyexcel-cli.readthedocs.org/en/latest/
Support the project
If your company uses pyexcel and its components in a revenue-generating product,
please consider supporting the project on GitHub or
Patreon <https://www.patreon.com/bePatron?u=5537627>_. Your financial
support will enable me to dedicate more time to coding, improving documentation,
and creating engaging content.
Known constraints
Fonts, colors and charts are not supported.
Nor to read password protected xls, xlsx and ods files.
Introduction
pyexcel-cli brings pyexcel <https://github.com/pyexcel/pyexcel>_ to make it easy
to consume/produce information stored in excel files on command line interface.
This library can turn the excel data into a list of lists, a list of records(dictionaries),
dictionaries of lists. And vice versa. Hence it lets you focus on data in shell
programming, instead of file formats.
Hightlighted features:
#. View data in the excel files without Microsoft Office or Open Office #. Transcode data among supported excel file formats #. Merge files in various excel file formats into one #. Split a multi-sheet excel file into single sheet files #. Find difference in data between two excel files
Usage
.. code-block:: bash
pyexcel view --in-browser --output-file-type sortable.html --sheet-index 0 https://github.com/pyexcel/excel2table/raw/master/sample/goog.ods
Here's what you will get:
.. image:: https://github.com/pyexcel/pyexcel-cli/raw/master/pyexcel-cli-sortable.gif
.. note::
You will need to install pyexcel-sortable, which renders it.
Here is another cli example usage:
.. code-block:: bash
$ pyexcel view https://github.com/pyexcel/pyexcel-cli/blob/master/tests/fixtures/multiple-sheets.xls
Sheet 1:
+---+---+---+
| 1 | 2 | 3 |
+---+---+---+
| 4 | 5 | 6 |
+---+---+---+
| 7 | 8 | 9 |
+---+---+---+
Sheet 2:
+---+---+---+
| X | Y | Z |
+---+---+---+
| 1 | 2 | 3 |
+---+---+---+
| 4 | 5 | 6 |
+---+---+---+
Sheet 3:
+---+---+---+
| O | P | Q |
+---+---+---+
| 3 | 2 | 1 |
+---+---+---+
| 4 | 3 | 2 |
+---+---+---+
Because pyexcel family is loosely coupled, especially for file format supports, you install the libraries that you need to. If you need to support xls format, you will need to install pyexcel-xls. For more information, please see the plugin section.
.. _file-format-list: .. _a-map-of-plugins-and-file-formats:
.. table:: A list of file formats supported by external plugins
======================== ======================= =================
Package name Supported file formats Dependencies
======================== ======================= =================
pyexcel-io_ csv, csvz [#f1], tsv, csvz,tsvz readers depends on chardet
tsvz [#f2]
pyexcel-xls_ xls, xlsx(read only), xlrd,
xlsm(read only) xlwt
pyexcel-xlsx_ xlsx openpyxl_
pyexcel-ods3_ ods pyexcel-ezodf,
lxml
pyexcel-ods ods odfpy_
======================== ======================= =================
.. table:: Dedicated file reader and writers
======================== ======================= =================
Package name Supported file formats Dependencies
======================== ======================= =================
pyexcel-xlsxw_ xlsx(write only) XlsxWriter_
pyexcel-libxlsxw_ xlsx(write only) libxlsxwriter_
pyexcel-xlsxr_ xlsx(read only) lxml
pyexcel-xlsbr_ xlsb(read only) pyxlsb
pyexcel-odsr_ read only for ods, fods lxml
pyexcel-odsw_ write only for ods loxun
pyexcel-htmlr_ html(read only) lxml,html5lib
pyexcel-pdfr_ pdf(read only) camelot
======================== ======================= =================
Plugin shopping guide
Since 2020, all pyexcel-io plugins have dropped the support for python versions which are lower than 3.6. If you want to use any of those Python versions, please use pyexcel-io and its plugins versions that are lower than 0.6.0.
Except csv files, xls, xlsx and ods files are a zip of a folder containing a lot of xml files
The dedicated readers for excel files can stream read
In order to manage the list of plugins installed, you need to use pip to add or remove a plugin. When you use virtualenv, you can have different plugins per virtual environment. In the situation where you have multiple plugins that does the same thing in your environment, you need to tell pyexcel which plugin to use per function call. For example, pyexcel-ods and pyexcel-odsr, and you want to get_array to use pyexcel-odsr. You need to append get_array(..., library='pyexcel-odsr').
.. _pyexcel-io: https://github.com/pyexcel/pyexcel-io .. _pyexcel-xls: https://github.com/pyexcel/pyexcel-xls .. _pyexcel-xlsx: https://github.com/pyexcel/pyexcel-xlsx .. _pyexcel-ods: https://github.com/pyexcel/pyexcel-ods .. _pyexcel-ods3: https://github.com/pyexcel/pyexcel-ods3 .. _pyexcel-odsr: https://github.com/pyexcel/pyexcel-odsr .. _pyexcel-odsw: https://github.com/pyexcel/pyexcel-odsw .. _pyexcel-pdfr: https://github.com/pyexcel/pyexcel-pdfr
.. _pyexcel-xlsxw: https://github.com/pyexcel/pyexcel-xlsxw .. _pyexcel-libxlsxw: https://github.com/pyexcel/pyexcel-libxlsxw .. _pyexcel-xlsxr: https://github.com/pyexcel/pyexcel-xlsxr .. _pyexcel-xlsbr: https://github.com/pyexcel/pyexcel-xlsbr .. _pyexcel-htmlr: https://github.com/pyexcel/pyexcel-htmlr
.. _xlrd: https://github.com/python-excel/xlrd .. _xlwt: https://github.com/python-excel/xlwt .. _openpyxl: https://bitbucket.org/openpyxl/openpyxl .. _XlsxWriter: https://github.com/jmcnamara/XlsxWriter .. _pyexcel-ezodf: https://github.com/pyexcel/pyexcel-ezodf .. _odfpy: https://github.com/eea/odfpy .. _libxlsxwriter: http://libxlsxwriter.github.io/getting_started.html
.. table:: Other data renderers
======================== ======================= ================= ==================
Package name Supported file formats Dependencies Python versions
======================== ======================= ================= ==================
pyexcel-text_ write only:rst, tabulate_ 2.6, 2.7, 3.3, 3.4
mediawiki, html, 3.5, 3.6, pypy
latex, grid, pipe,
orgtbl, plain simple
read only: ndjson
r/w: json
pyexcel-handsontable_ handsontable in html handsontable_ same as above
pyexcel-pygal_ svg chart pygal_ 2.7, 3.3, 3.4, 3.5
3.6, pypy
pyexcel-sortable_ sortable table in html csvtotable_ same as above
pyexcel-gantt_ gantt chart in html frappe-gantt_ except pypy, same
as above
======================== ======================= ================= ==================
.. _pyexcel-text: https://github.com/pyexcel/pyexcel-text .. _tabulate: https://bitbucket.org/astanin/python-tabulate .. _pyexcel-handsontable: https://github.com/pyexcel/pyexcel-handsontable .. _handsontable: https://cdnjs.com/libraries/handsontable .. _pyexcel-pygal: https://github.com/pyexcel/pyexcel-chart .. _pygal: https://github.com/Kozea/pygal .. _pyexcel-matplotlib: https://github.com/pyexcel/pyexcel-matplotlib .. _matplotlib: https://matplotlib.org .. _pyexcel-sortable: https://github.com/pyexcel/pyexcel-sortable .. _csvtotable: https://github.com/vividvilla/csvtotable .. _pyexcel-gantt: https://github.com/pyexcel/pyexcel-gantt .. _frappe-gantt: https://github.com/frappe/gantt
.. rubric:: Footnotes
.. [#f1] zipped csv file .. [#f2] zipped tsv file
Installation
You can install pyexcel-cli via pip:
.. code-block:: bash
$ pip install pyexcel-cli
or clone it and install it:
.. code-block:: bash
$ git clone https://github.com/pyexcel/pyexcel-cli.git
$ cd pyexcel-cli
$ python setup.py install
Development guide
Development steps for code changes
#. git clone https://github.com/pyexcel/pyexcel-cli.git #. cd pyexcel-cli
Upgrade your setup tools and pip. They are needed for development and testing only:
#. pip install --upgrade setuptools pip
Then install relevant development requirements:
#. pip install -r rnd_requirements.txt # if such a file exists #. pip install -r requirements.txt #. pip install -r tests/requirements.txt
Once you have finished your changes, please provide test case(s), relevant documentation and update changelog.yml
.. note::
As to rnd_requirements.txt, usually, it is created when a dependent
library is not released. Once the dependency is installed
(will be released), the future
version of the dependency in the requirements.txt will be valid.
How to test your contribution
Although nose and doctest are both used in code testing, it is advisable
that unit tests are put in tests. doctest is incorporated only to make sure
the code examples in documentation remain valid across different development
releases.
On Linux/Unix systems, please launch your tests like this::
$ make
On Windows, please issue this command::
> test.bat
Before you commit
Please run::
$ make format
so as to beautify your code otherwise your build may fail your unit test.
License
New BSD License