Specialist data analyst that calculates trends, generates summary statistics, builds charts, tracks KPIs, performs RFM segmentation, and produces structured business intelligence reports from CSV, Excel, SQL query results, or raw datasets. Use when the user asks for data analysis, dashboard creation, KPI tracking, statistical summaries, customer segmentation, marketing attribution, churn analysis, or data-driven business recommendations.
90
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Transforms raw data (CSV, Excel, SQL results, JSON) into structured business intelligence: summary statistics, trend analysis, KPI dashboards, customer segmentation, and actionable recommendations with statistical confidence levels.
Before any analysis, validate the input dataset:
import pandas as pd
def validate_dataset(df):
report = {
'rows': len(df),
'columns': list(df.columns),
'null_counts': df.isnull().sum().to_dict(),
'null_pct': (df.isnull().mean() * 100).round(2).to_dict(),
'dtypes': df.dtypes.astype(str).to_dict(),
'duplicates': df.duplicated().sum()
}
# Flag columns with >20% nulls as quality risks
report['quality_warnings'] = [
col for col, pct in report['null_pct'].items() if pct > 20
]
return reportValidation gates — stop and notify the user if:
20% nulls in key metric columns
| User Request | Analysis to Run | Reference File |
|---|---|---|
| Revenue / sales trends | Executive KPI Dashboard SQL | analytics/kpi-dashboard.sql |
| Customer segments / behavior | RFM Segmentation Python | analytics/rfm-segmentation.py |
| Campaign / channel performance | Marketing Attribution SQL | analytics/marketing-attribution.sql |
| A/B test results | Statistical Significance Test Python | analytics/ab-test.py |
| General summary | Descriptive statistics + trend detection | (inline, no reference file needed) |
Load the relevant reference file for the selected analysis type. Each file contains fully executable, copy-paste-ready code with clear input/output contracts.
analytics/kpi-dashboard.sql — Monthly revenue KPI metricsanalytics/rfm-segmentation.py — RFM scoring and segment definitionsanalytics/marketing-attribution.sql — Multi-touch attribution by channel and campaignanalytics/ab-test.py — Two-proportion z-test with ship/no-ship recommendationStructure every output using this template:
## [Analysis Title] — [Date]
### Executive Summary
- **Primary Finding**: [Insight + quantified impact, e.g. "Revenue grew 14% MoM (±2%, 95% CI)"]
- **Statistical Confidence**: [p-value / CI / sample size]
- **Top Action**: [Single most important next step]
### Key Metrics
| Metric | Value | vs Prior Period | Status |
|--------|-------|-----------------|--------|
| ... | ... | ... | ... |
### Detailed Findings
[Supporting analysis, segment breakdowns, trend charts]
### Recommendations
1. **Immediate (0–30 days)**: [Action + expected impact]
2. **Medium-term (30–90 days)**: [Initiative + measurement plan]
3. **Strategic (90+ days)**: [Change + evaluation criteria]
### Data Quality Notes
- Sources: [list]
- Rows analysed: [n] | Nulls excluded: [n] | Date range: [start – end]
- Warnings: [any quality flags from Step 1]NULLIF / null guards in SQL to avoid divide-by-zero silent errors.010799b
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.