Quick start guide¶
The 60-second version¶
Clone and install¶
git clone https://github.com/GlassAlpha/glassalpha
cd glassalpha/packages
# Install base framework (includes LogisticRegression baseline)
pip install -e .
# Check what models are available
glassalpha models
Generate an audit PDF (uses included German Credit example)¶
# Quick start with LogisticRegression (always available)
glassalpha audit --config configs/quickstart.yaml --output audit.pdf
# Or use the German Credit example (also uses LogisticRegression now)
glassalpha audit --config configs/german_credit_simple.yaml --output audit.pdf
That's it! You now have a complete audit report with model performance, SHAP explanations, and fairness metrics.
The 10-minute version¶
Get up and running with GlassAlpha in less than 10 minutes. This guide will take you from installation to generating your first professional audit PDF.
Prerequisites¶
- Python 3.11 or higher
- Git
- 2GB available disk space
- Command line access
Step 1: Installation¶
Clone and install¶
Clone and setup:
Python 3.11 or 3.12 recommended:
Create a virtual environment (recommended):
Install GlassAlpha:
python -m pip install --upgrade pip
# Option 1: Base install (LogisticRegression only, recommended for getting started)
pip install -e .
# Option 2: With advanced ML libraries (if you need XGBoost/LightGBM)
pip install -e ".[explain]" # SHAP + XGBoost + LightGBM
pip install -e ".[all]" # All features
# Option 3: Development install (includes testing tools)
pip install -e ".[dev]"
Verify installation:
You should see the CLI help message with available commands.
Step 2: Generate your first audit¶
GlassAlpha comes with a ready-to-use German Credit dataset example that demonstrates all core capabilities.
Run the audit command¶
Generate audit PDF (takes ~3 seconds):
What happens¶
- Automatic Dataset Resolution: Uses built-in German Credit dataset from registry
- Model Training: Trains XGBoost classifier with optimal parameters
- Explanations: Generates TreeSHAP feature importance analysis
- Fairness Analysis: Computes bias metrics for protected attributes (gender, age)
- PDF Generation: Creates professional audit report with visualizations
Expected output¶
Loading data and initializing components...
✓ Audit pipeline completed in 2.34s
📊 Audit Summary:
✅ Performance metrics: 6 computed
✅ accuracy: 73.5%
⚖️ Fairness metrics: 8/8 computed
✅ No bias detected
🔍 Explanations: ✅ Global feature importance
Most important: duration_months (+0.127)
📋 Dataset: 1,000 samples, 21 features
🔧 Components: 3 selected
Model: xgboost
Generating PDF report: my_first_audit.pdf
✓ Saved plot to /tmp/plots/shap_importance.png
✓ Saved plot to /tmp/plots/performance_summary.png
✓ Saved plot to /tmp/plots/fairness_analysis.png
🎉 Audit Report Generated Successfully!
==================================================
📁 Output: /path/to/my_first_audit.pdf
📊 Size: 847,329 bytes (827.5 KB)
⏱️ Total time: 3.12s
• Pipeline: 2.34s
• PDF generation: 0.78s
The audit report is ready for review and regulatory submission.
Step 3: Review your audit report¶
Open my_first_audit.pdf
to see your comprehensive audit report containing:
Executive summary¶
- Key findings and compliance status
- Model performance overview
- Bias detection results
- Regulatory assessment
Model performance analysis¶
- Accuracy, precision, recall, F1 score, AUC-ROC
- Confusion matrix
- Performance visualizations
SHAP explanations¶
- Global feature importance rankings
- Individual prediction explanations
- Waterfall plots showing decision factors
Fairness analysis¶
- Demographic parity assessment
- Equal opportunity analysis
- Bias detection across protected attributes
- Statistical significance testing
Reproducibility manifest¶
- Complete audit trail with timestamps
- Dataset fingerprints and model parameters
- Random seeds and component versions
- Git commit information
Step 4: Understanding the configuration¶
The configs/german_credit_simple.yaml
file contains all audit settings:
Audit profile determines component selection:
Reproducibility settings:
Data configuration:
data:
dataset: german_credit # Uses built-in German Credit dataset
fetch: if_missing # Automatically download if needed
target_column: credit_risk
protected_attributes:
- gender
- age_group
- foreign_worker
Model configuration:
Explainer selection:
explainers:
strategy: first_compatible
priority:
- treeshap # Primary choice for tree models
- kernelshap # Fallback for any model type
Metrics to compute:
metrics:
performance:
metrics: [accuracy, precision, recall, f1, auc_roc]
fairness:
metrics: [demographic_parity, equal_opportunity]
Next steps¶
Try advanced features¶
Enable strict mode for regulatory compliance:
glassalpha audit \
--config configs/german_credit_simple.yaml \
--output regulatory_audit.pdf \
--strict
Use a different model (edit config file: model.type: lightgbm):
Explore more options¶
See all available CLI options:
List available components:
Validate configuration without running audit:
Manage datasets:
glassalpha datasets list # See available datasets
glassalpha datasets info german_credit # Show dataset details
glassalpha datasets cache-dir # Show where datasets are cached
Work with your own data¶
- Prepare your data: CSV format with target column and features
- Create configuration: Copy and modify
german_credit_simple.yaml
- Run audit: Use your configuration file
See the Configuration Guide for detailed customization options.
Common use cases¶
Financial services compliance¶
- Credit scoring model validation
- Fair lending assessments
- Regulatory reporting (ECOA, FCRA)
- Model risk management
HR and employment¶
- Hiring algorithm audits
- Promotion decision analysis
- Salary equity assessments
- EEO compliance verification
Healthcare and insurance¶
- Risk assessment model validation
- Treatment recommendation audits
- Coverage decision analysis
- Health equity evaluations
Getting help¶
- Documentation: Complete Guide
- Configuration Reference: Configuration Guide
- Examples:
- German Credit Deep Dive - Complete audit walkthrough
- Healthcare Bias Detection - Medical AI compliance example
- Fraud Detection Audit - Financial services example
- Issues: GitHub Issues
Summary¶
You now have GlassAlpha installed and have generated your first audit report. The system provides:
- Production-ready audit generation in seconds
- Professional PDF reports suitable for regulatory review
- Comprehensive analysis covering performance, fairness, and explainability
- Full reproducibility with complete audit trails
- Flexible configuration for different use cases and models
GlassAlpha transforms complex ML audit requirements into a simple, reliable workflow that meets the highest professional and regulatory standards.