Comprehensive toolkit for survival analysis and time-to-event modeling in Python using scikit-survival. Use this skill when working with censored survival data, performing time-to-event analysis, fitting Cox models, Random Survival Forests, Gradient Boosting models, or Survival SVMs, evaluating survival predictions with concordance index or Brier score, handling competing risks, or implementing any survival analysis workflow with the scikit-survival library.
90
82%
Does it follow best practices?
Impact
96%
1.24xAverage score across 6 eval scenarios
Passed
No known issues
Quality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that clearly defines its scope, provides comprehensive trigger terms covering the survival analysis domain, and explicitly states both what the skill does and when to use it. The description is well-structured with a concise summary followed by a detailed 'Use this skill when' clause listing specific scenarios. It occupies a clear niche that would be easily distinguishable from other data science or machine learning skills.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: survival analysis, time-to-event modeling, fitting Cox models, Random Survival Forests, Gradient Boosting models, Survival SVMs, evaluating with concordance index or Brier score, handling competing risks. | 3 / 3 |
Completeness | Clearly answers both what ('comprehensive toolkit for survival analysis and time-to-event modeling in Python using scikit-survival') and when ('Use this skill when working with censored survival data, performing time-to-event analysis, fitting Cox models...') with an explicit 'Use this skill when' clause. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms a user would say: 'survival analysis', 'time-to-event', 'censored survival data', 'Cox models', 'Random Survival Forests', 'concordance index', 'Brier score', 'competing risks', 'scikit-survival'. These are the exact terms practitioners use. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive niche — survival analysis with scikit-survival is a very specific domain. The mention of specific model types (Cox, RSF, Survival SVMs) and evaluation metrics (concordance index, Brier score) make it extremely unlikely to conflict with general ML or statistics skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a well-structured and highly actionable skill for scikit-survival with excellent executable code examples and a useful model selection decision tree. Its main weaknesses are verbosity (explaining survival analysis concepts Claude already knows, lengthy 'when to use' sections) and lack of validation/error-handling steps in workflows. The progressive disclosure structure is reasonable but the main file is heavy with content that could be delegated to reference files.
Suggestions
Remove the Overview paragraph explaining what survival analysis is and the 'When to Use This Skill' section — Claude knows these concepts and the skill description already covers this.
Add validation checkpoints to workflows: e.g., check y array structure after Surv creation, verify no negative times, confirm model convergence, and add error handling for common failures like singular matrices in Cox models.
Move 2-3 of the detailed workflow examples to a references/workflows.md file to reduce the main file's token footprint while keeping Workflow 1 as the primary inline example.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill contains some unnecessary explanations (e.g., 'Survival analysis aims to establish connections between covariates...', the 'When to Use This Skill' section listing obvious use cases, and the 'Overview' paragraph explaining what survival analysis is). The best practices and pitfalls sections are useful but somewhat verbose. However, the code examples and decision tree are efficient. | 2 / 3 |
Actionability | The skill provides fully executable, copy-paste ready code examples across multiple workflows (standard analysis, high-dimensional data, ensemble methods, model comparison). Key imports are consolidated, metrics usage is concrete with specific function calls, and the decision tree provides clear model selection guidance. | 3 / 3 |
Workflow Clarity | The four workflows are clearly sequenced with numbered steps and executable code. However, there are no validation checkpoints or error recovery steps — no checks for data quality issues, no verification that models converged, no feedback loops for when fitting fails or produces poor results. For survival analysis with potential data issues (negative times, insufficient events), this is a notable gap. | 2 / 3 |
Progressive Disclosure | The skill references six detailed reference files (cox-models.md, ensemble-models.md, etc.) with clear descriptions, which is good structure. However, no bundle files were provided, so these references are unverifiable. The main SKILL.md itself is quite long (~300+ lines) and includes substantial inline content that could have been pushed to references, particularly the four complete workflow examples and the comprehensive best practices/pitfalls lists. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
90%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 10 / 11 Passed
Validation for skill structure
| Criteria | Description | Result |
|---|---|---|
metadata_version | 'metadata.version' is missing | Warning |
Total | 10 / 11 Passed | |
cbcae7b
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.