Github url

word_cloud

by amueller

amueller /word_cloud

A little word cloud generator in Python

7.3K Stars 1.9K Forks Last release: about 2 years ago (1.5.0) MIT License 535 Commits 9 Releases

Available items

No Items, yet!

The developer of this repository has not created any items for sale yet. Need a bug fixed? Help with integration? A different license? Create a request here:

licence DOI

| | Linux | macOS | Windows | |------|----------------------------------------------|----------------------------------------------|----------------------------------------------| | PyPI | CircleCI | TravisCI | AppVeyor |

word_cloud

A little word cloud generator in Python. Read more about it on the blog post or the website.

The code is tested against Python 2.7, 3.4, 3.5, 3.6 and 3.7.

Installation

If you are using pip:

pip install wordcloud

If you are using conda, you can install from the

conda-forge
channel:
conda install -c conda-forge wordcloud

Installation notes

wordcloud depends on

numpy
and
pillow
.

To save the wordcloud into a file,

matplotlib
can also be installed. See examples below.

If there are no wheels available for your version of python, installing the package requires having a C compiler set up. Before installing a compiler, report an issue describing the version of python and operating system being used.

Examples

Check out examples/simple.py for a short intro. A sample output is:

Constitution

Or run examples/masked.py to see more options. A sample output is:

Alice in Wonderland

Getting fancy with some colors: Parrot with rainbow colors

Generating wordclouds for Arabic:

Arabic wordlcloud

Command-line usage

The

wordcloud_cli
tool can be used to generate word clouds directly from the command-line:
$ wordcloud_cli --text mytext.txt --imagefile wordcloud.png

If you're dealing with PDF files, then

pdftotext
, included by default with many Linux distribution, comes in handy:
$ pdftotext mydocument.pdf - | wordcloud_cli --imagefile wordcloud.png

In the previous example, the

-
argument orders
pdftotext
to write the resulting text to stdout, which is then piped to the stdin of
wordcloud_cli.py
.

Use

wordcloud_cli --help
so see all available options.

Licensing

The wordcloud library is MIT licenced, but contains DroidSansMono.ttf, a true type font by Google, that is apache licensed. The font is by no means integral, and any other font can be used by setting the

font_path
variable when creating a
WordCloud
object.

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.