Summary and Schedule
This is a new lesson built with The Carpentries Workbench.
| Setup Instructions | Download files required for the lesson | |
| Duration: 00h 00m | 1. Introduction to Data Visualization |
What exactly is data visualization and how does it differ from simple
charts? Why do humans understand visualized data much faster than raw numbers or tables? What are the real advantages and hidden pitfalls of using visuals in data analysis? How does visualization help (or complicate) working with big data? Which tools are most suitable for beginners, intermediate users, and advanced programmers? What makes a visualization “good” versus “misleading”? :::::::::::::::::::::::::::::::::::::::::::::::: |
| Duration: 01h 15m | Finish |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
Overview
To participate in the Data Visualization module, you will need:
- Python 3.10 or newer (3.11–3.12 recommended in local setups)
- The following Python packages:
- pandas (data handling)
- matplotlib (core plotting)
- seaborn (statistical visualizations)
- (optional but recommended) jupyterlab or notebook (interactive work)
- (optional) plotly (interactive charts in later episodes)
- A code editor or notebook interface
- Sample datasets (provided as a zip file)
We offer two main setup paths:
- Google Colab (recommended for beginners / no installation needed)
-
Local installation with
Anaconda Navigator(great for offline work and full control)
Option 1: Google Colab (Zero Installation – Recommended for Most Learners)
Google Colab is a free, cloud-based Jupyter notebook environment hosted by Google. It runs entirely in your browser, requires only a Google account, and comes with pandas, matplotlib, seaborn, and many other data science libraries pre-installed.
Steps
Sign in with your Google account (or create one if needed).
Click New notebook (or File → New notebook).
(Optional) Rename it: File → Rename (e.g., “Data Viz Workshop – Aniket”).
-
Test the libraries right away by running this in the first cell (Shift+Enter to execute):
PYTHON
import pandas as pd import matplotlib.pyplot as plt import seaborn as sns import plotly.express as px # optional – usually pre-installed too print("pandas version:", pd.__version__) print("matplotlib version:", plt.matplotlib.__version__) print("seaborn version:", sns.__version__) print("plotly version:", px.__version__ if 'px' in globals() else "not imported") # Quick test plot (should show inline) tips = sns.load_dataset("tips") # built-in Seaborn dataset sns.histplot(data=tips, x="total_bill", hue="time") plt.title("Test: Restaurant Tips Distribution") plt.show() Installing extra packages (rarely needed, but if something is missing or outdated):Python
(The ! runs shell commands in Colab/Jupyter
Notebook.)
Advantages of Colab for this workshop
- No software installation
- Free GPU/TPU if needed later
- Easy sharing (File → Share)
- Autosaves to Google Drive
- Perfect for following along with instructor demos
Tip: Upload your own data files via the left sidebar (Files tab → Upload) or mount Google Drive: Python
Option 2: Local Installation (Anaconda Navigator – For Offline / Advanced Use)
Use this if you prefer working without internet or need a persistent environment.
- Download and install Anaconda Navigator:
- https://www.anaconda.com/products/navigator
- Choose your OS installer (Python 3.x version) → follow defaults
- You should find multiple apps after installation.
- Launch
Jupyter Notebook - If you do not find a package simply add
!pipfollowed by the name of the package in code cell to install it locally.
Troubleshooting
- Colab: Plots not showing? Add %matplotlib inline at the top (usually automatic).
-
Local: package not found: Open terminal or code
cell in jupyter notebook and
!pipinstall package. - Need help? Raise hand during workshop or check Carpentries Python setup guide.
You’re all set! Proceed to Introduction to Data Visualization or Creating Your First Plots.
Happy visualizing!