github.com/K-Dense-AI/claude-scientific-skills
Skill | Added | Review |
|---|---|---|
zarr-python Chunked N-D arrays for cloud storage. Compressed arrays, parallel I/O, S3/GCS integration, NumPy/Dask/Xarray compatible, for large-scale scientific computing pipelines. | 69 1.25x Agent success vs baseline Impact 90% 1.25xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
venue-templates Access comprehensive LaTeX templates, formatting requirements, and submission guidelines for major scientific publication venues (Nature, Science, PLOS, IEEE, ACM), academic conferences (NeurIPS, ICML, CVPR, CHI), research posters, and grant proposals (NSF, NIH, DOE, DARPA). This skill should be used when preparing manuscripts for journal submission, conference papers, research posters, or grant proposals and need venue-specific formatting requirements and templates. | 57 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
vaex Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that do not fit in memory. | 82 1.22x Agent success vs baseline Impact 72% 1.22xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
umap-learn UMAP dimensionality reduction. Fast nonlinear manifold learning for 2D/3D visualization, clustering preprocessing (HDBSCAN), supervised/parametric UMAP, for high-dimensional data. | 80 2.11x Agent success vs baseline Impact 89% 2.11xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
treatment-plans Generate concise (3-4 page), focused medical treatment plans in LaTeX/PDF format for all clinical specialties. Supports general medical treatment, rehabilitation therapy, mental health care, chronic disease management, perioperative care, and pain management. Includes SMART goal frameworks, evidence-based interventions with minimal text citations, regulatory compliance (HIPAA), and professional formatting. Prioritizes brevity and clinical actionability. | 65 1.44x Agent success vs baseline Impact 75% 1.44xAverage score across 6 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
transformers This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification, question answering, translation, summarization, image classification, object detection, speech recognition, and fine-tuning models on custom datasets. | 63 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
torchdrug PyTorch-native graph neural networks for molecules and proteins. Use when building custom GNN architectures for drug discovery, protein modeling, or knowledge graph reasoning. Best for custom model development, protein property prediction, retrosynthesis. For pre-trained models and diverse featurizers use deepchem; for benchmark datasets use pytdc. | 89 1.18x Agent success vs baseline Impact 94% 1.18xAverage score across 9 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
sympy Use this skill when working with symbolic mathematics in Python. This skill should be used for symbolic computation tasks including solving equations algebraically, performing calculus operations (derivatives, integrals, limits), manipulating algebraic expressions, working with matrices symbolically, physics calculations, number theory problems, geometry computations, and generating executable code from mathematical expressions. Apply this skill when the user needs exact symbolic results rather than numerical approximations, or when working with mathematical formulas that contain variables and parameters. | 91 1.12x Agent success vs baseline Impact 89% 1.12xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
statsmodels Statistical models library for Python. Use when you need specific model classes (OLS, GLM, mixed models, ARIMA) with detailed diagnostics, residuals, and inference. Best for econometrics, time series, rigorous inference with coefficient tables. For guided statistical test selection with APA reporting use statistical-analysis. | 85 1.09x Agent success vs baseline Impact 93% 1.09xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
statistical-analysis Guided statistical analysis with test selection and reporting. Use when you need help choosing appropriate tests for your data, assumption checking, power analysis, and APA-formatted results. Best for academic research reporting, test selection guidance. For implementing specific models programmatically use statsmodels. | 84 1.13x Agent success vs baseline Impact 91% 1.13xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
stable-baselines3 Production-ready reinforcement learning algorithms (PPO, SAC, DQN, TD3, DDPG, A2C) with scikit-learn-like API. Use for standard RL experiments, quick prototyping, and well-documented algorithm implementations. Best for single-agent RL with Gymnasium environments. For high-performance parallel training, multi-agent systems, or custom vectorized environments, use pufferlib instead. | 91 1.07x Agent success vs baseline Impact 95% 1.07xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
simpy Process-based discrete-event simulation framework in Python. Use this skill when building simulations of systems with processes, queues, resources, and time-based events such as manufacturing systems, service operations, network traffic, logistics, or any system where entities interact with shared resources over time. | 88 1.73x Agent success vs baseline Impact 92% 1.73xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
shap Model interpretability and explainability using SHAP (SHapley Additive exPlanations). Use this skill when explaining machine learning model predictions, computing feature importance, generating SHAP plots (waterfall, beeswarm, bar, scatter, force, heatmap), debugging models, analyzing model bias or fairness, comparing models, or implementing explainable AI. Works with tree-based models (XGBoost, LightGBM, Random Forest), deep learning (TensorFlow, PyTorch), linear models, and any black-box model. | 83 1.29x Agent success vs baseline Impact 80% 1.29xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
seaborn Statistical visualization with pandas integration. Use for quick exploration of distributions, relationships, and categorical comparisons with attractive defaults. Best for box plots, violin plots, pair plots, heatmaps. Built on matplotlib. For interactive plots use plotly; for publication styling use scientific-visualization. | 84 2.81x Agent success vs baseline Impact 90% 2.81xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scvi-tools Deep generative models for single-cell omics. Use when you need probabilistic batch correction (scVI), transfer learning, differential expression with uncertainty, or multi-modal integration (TOTALVI, MultiVI). Best for advanced modeling, batch effects, multimodal data. For standard analysis pipelines use scanpy. | 92 1.14x Agent success vs baseline Impact 97% 1.14xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scikit-survival Comprehensive toolkit for survival analysis and time-to-event modeling in Python using scikit-survival. Use this skill when working with censored survival data, performing time-to-event analysis, fitting Cox models, Random Survival Forests, Gradient Boosting models, or Survival SVMs, evaluating survival predictions with concordance index or Brier score, handling competing risks, or implementing any survival analysis workflow with the scikit-survival library. | 92 1.24x Agent success vs baseline Impact 96% 1.24xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scikit-learn Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model evaluation, hyperparameter tuning, preprocessing, or building ML pipelines. Provides comprehensive reference documentation for algorithms, preprocessing techniques, pipelines, and best practices. | 81 0.96x Agent success vs baseline Impact 93% 0.96xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scikit-bio Biological data toolkit. Sequence analysis, alignments, phylogenetic trees, diversity metrics (alpha/beta, UniFrac), ordination (PCoA), PERMANOVA, FASTA/Newick I/O, for microbiome analysis. | 72 2.21x Agent success vs baseline Impact 73% 2.21xAverage score across 6 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scientific-writing Core skill for the deep research and writing tool. Write scientific manuscripts in full paragraphs (never bullet points). Use two-stage process with (1) section outlines with key points using research-lookup then (2) convert to flowing prose. IMRAD structure, citations (APA/AMA/Vancouver), figures/tables, reporting guidelines (CONSORT/STROBE/PRISMA), for research papers and journal submissions. | 70 1.78x Agent success vs baseline Impact 93% 1.78xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
scientific-visualization Meta-skill for publication-ready figures. Use when creating journal submission figures requiring multi-panel layouts, significance annotations, error bars, colorblind-safe palettes, and specific journal formatting (Nature, Science, Cell). Orchestrates matplotlib/seaborn/plotly with publication styles. For quick exploration use seaborn or plotly directly. | 76 1.13x Agent success vs baseline Impact 94% 1.13xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scientific-slides Build slide decks and presentations for research talks. Use this for making PowerPoint slides, conference presentations, seminar talks, research presentations, thesis defense slides, or any scientific talk. Provides slide structure, design templates, timing guidance, and visual validation. Works with PowerPoint and LaTeX Beamer. | 75 4.45x Agent success vs baseline Impact 89% 4.45xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
scientific-schematics Create publication-quality scientific diagrams using Nano Banana 2 AI with smart iterative refinement. Uses Gemini 3.1 Pro Preview for quality review. Only regenerates if quality is below threshold for your document type. Specialized in neural network architectures, system diagrams, flowcharts, biological pathways, and complex scientific visualizations. | 68 2.87x Agent success vs baseline Impact 95% 2.87xAverage score across 3 eval scenarios Securityby Risky Do not use without reviewing Reviewed: Version: cbcae7b | |
scientific-critical-thinking Evaluate scientific claims and evidence quality. Use for assessing experimental design validity, identifying biases and confounders, applying evidence grading frameworks (GRADE, Cochrane Risk of Bias), or teaching critical analysis. Best for understanding evidence quality, identifying flaws. For formal peer review writing use peer-review. | 79 1.02x Agent success vs baseline Impact 96% 1.02xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
scientific-brainstorming Creative research ideation and exploration. Use for open-ended brainstorming sessions, exploring interdisciplinary connections, challenging assumptions, or identifying research gaps. Best for early-stage research planning when you do not have specific observations yet. For formulating testable hypotheses from data use hypothesis-generation. | 84 1.02x Agent success vs baseline Impact 89% 1.02xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
scholar-evaluation Systematically evaluate scholarly work using the ScholarEval framework, providing structured assessment across research quality dimensions including problem formulation, methodology, analysis, and writing with quantitative scoring and actionable feedback. | 55 1.67x Agent success vs baseline Impact 92% 1.67xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b |