Need help with data-describe?
Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

About the developer

285 Stars 15 Forks Other 695 Commits 81 Opened issues


data⎰describe: Pythonic EDA Accelerator for Data Science

Services available


Need anything else?

Contributors list

PyPI status PyPI license Downloads

PyPI version PyPI pyversions codecov

data ⎰ describe

data-describe is a Python toolkit for Exploratory Data Analysis (EDA). It aims to accelerate data exploration and analysis by providing automated and polished analysis widgets.

For more examples of data-describe in action, see the Quick Start Tutorial.

Main Features

data-describe implements the following basic features:

| Feature | Description | | ----------- | ----------- | | Data Summary | Curated data summary | | Data Heatmap | Data variation and missingness heatmap | | Correlation Matrix | Correlation heatmaps with categorical support | | Distribution Plots | Generate histograms, violin plots, bar charts | | Scatterplots | Generate scatterplots and evaluate with scatterplot diagnostics | | Cluster Analysis | Automated clustering and plotting | | Feature Ranking | Evaluate feature importance using tree models |

Extended Features

data-describe is always looking to elevate the standard for Exploratory Data Analysis. Here are just a few that are implemented:

  • Dimensionality Reduction Methods
  • Sensitive Data (PII) Redaction
  • Text Pre-processing / Topic Modeling
  • Big Data Support


data-describe can be installed using pip:

pip install data-describe

Getting Started

import data_describe as dd

See the User Guide for more information.

Project Status

data-describe is currently in beta status.


data-describe welcomes contributions from the community.

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.