github.com/K-Dense-AI/claude-scientific-skills
Skill | Added | Review |
|---|---|---|
neurokit2 Comprehensive biosignal processing toolkit for analyzing physiological data including ECG, EEG, EDA, RSP, PPG, EMG, and EOG signals. Use this skill when processing cardiovascular signals, brain activity, electrodermal responses, respiratory patterns, muscle activity, or eye movements. Applicable for heart rate variability analysis, event-related potentials, complexity measures, autonomic nervous system assessment, psychophysiology research, and multi-modal physiological signal integration. | 90 1.80x Agent success vs baseline Impact 99% 1.80xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
networkx Comprehensive toolkit for creating, analyzing, and visualizing complex networks and graphs in Python. Use when working with network/graph data structures, analyzing relationships between entities, computing graph algorithms (shortest paths, centrality, clustering), detecting communities, generating synthetic networks, or visualizing network topologies. Applicable to social networks, biological networks, transportation systems, citation networks, and any domain involving pairwise relationships. | 79 1.26x Agent success vs baseline Impact 81% 1.26xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
molfeat Molecular featurization for ML (100+ featurizers). ECFP, MACCS, descriptors, pretrained models (ChemBERTa), convert SMILES to features, for QSAR and molecular ML. | 72 1.41x Agent success vs baseline Impact 78% 1.41xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
modal Cloud computing platform for running Python on GPUs and serverless infrastructure. Use when deploying AI/ML models, running GPU-accelerated workloads, serving web endpoints, scheduling batch jobs, or scaling Python code to the cloud. Use this skill whenever the user mentions Modal, serverless GPU compute, deploying ML models to the cloud, serving inference endpoints, running batch processing in the cloud, or needs to scale Python workloads beyond their local machine. Also use when the user wants to run code on H100s, A100s, or other cloud GPUs, or needs to create a web API for a model. | 70 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
medchem Medicinal chemistry filters. Apply drug-likeness rules (Lipinski, Veber), PAINS filters, structural alerts, complexity metrics, for compound prioritization and library filtering. | 65 3.50x Agent success vs baseline Impact 63% 3.50xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
matlab MATLAB and GNU Octave numerical computing for matrix operations, data analysis, visualization, and scientific computing. Use when writing MATLAB/Octave scripts for linear algebra, signal processing, image processing, differential equations, optimization, statistics, or creating scientific visualizations. Also use when the user needs help with MATLAB syntax, functions, or wants to convert between MATLAB and Python code. Scripts can be executed with MATLAB or the open-source GNU Octave interpreter. | 63 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
matchms Spectral similarity and compound identification for metabolomics. Use for comparing mass spectra, computing similarity scores (cosine, modified cosine), and identifying unknown compounds from spectral libraries. Best for metabolite identification, spectral matching, library searching. For full LC-MS/MS proteomics pipelines use pyopenms. | 88 4.63x Agent success vs baseline Impact 88% 4.63xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
market-research-reports Generate comprehensive market research reports (50+ pages) in the style of top consulting firms (McKinsey, BCG, Gartner). Features professional LaTeX formatting, extensive visual generation with scientific-schematics and generate-image, deep integration with research-lookup for data gathering, and multi-framework strategic analysis including Porter Five Forces, PESTLE, SWOT, TAM/SAM/SOM, and BCG Matrix. | 71 2.00x Agent success vs baseline Impact 96% 2.00xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
literature-review Conduct comprehensive, systematic literature reviews using multiple academic databases (PubMed, arXiv, bioRxiv, Semantic Scholar, etc.). This skill should be used when conducting systematic literature reviews, meta-analyses, research synthesis, or comprehensive literature searches across biomedical, scientific, and technical domains. Creates professionally formatted markdown documents and PDFs with verified citations in multiple citation styles (APA, Nature, Vancouver, etc.). | 57 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
latex-posters Create professional research posters in LaTeX using beamerposter, tikzposter, or baposter. Support for conference presentations, academic posters, and scientific communication. Includes layout design, color schemes, multi-column formats, figure integration, and poster-specific best practices for visual communication. | 52 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
latchbio-integration Latch platform for bioinformatics workflows. Build pipelines with Latch SDK, @workflow/@task decorators, deploy serverless workflows, LatchFile/LatchDir, Nextflow/Snakemake integration. | 42 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
lamindb This skill should be used when working with LaminDB, an open-source data framework for biology that makes data queryable, traceable, reproducible, and FAIR. Use when managing biological datasets (scRNA-seq, spatial, flow cytometry, etc.), tracking computational workflows, curating and validating data with biological ontologies, building data lakehouses, or ensuring data lineage and reproducibility in biological research. Covers data management, annotation, ontologies (genes, cell types, diseases, tissues), schema validation, integrations with workflow managers (Nextflow, Snakemake) and MLOps platforms (W&B, MLflow), and deployment strategies. | 85 1.32x Agent success vs baseline Impact 74% 1.32xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
iso-13485-certification Comprehensive toolkit for preparing ISO 13485 certification documentation for medical device Quality Management Systems. Use when users need help with ISO 13485 QMS documentation, including (1) conducting gap analysis of existing documentation, (2) creating Quality Manuals, (3) developing required procedures and work instructions, (4) preparing Medical Device Files, (5) understanding ISO 13485 requirements, or (6) identifying missing documentation for medical device certification. Also use when users mention medical device regulations, QMS certification, FDA QMSR, EU MDR, or need help with quality system documentation. | 76 1.45x Agent success vs baseline Impact 93% 1.45xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
imaging-data-commons Query and download public cancer imaging data from NCI Imaging Data Commons using idc-index. Use for accessing large-scale radiology (CT, MR, PET) and pathology datasets for AI training or research. No authentication required. Query by metadata, visualize in browser, check licenses. | 74 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
hypothesis-generation Structured hypothesis formulation from observations. Use when you have experimental observations or data and need to formulate testable hypotheses with predictions, propose mechanisms, and design experiments to test them. Follows scientific method framework. For open-ended ideation use scientific-brainstorming; for automated LLM-driven hypothesis testing on datasets use hypogenic. | 78 1.67x Agent success vs baseline Impact 99% 1.67xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
hypogenic Automated LLM-driven hypothesis generation and testing on tabular datasets. Use when you want to systematically explore hypotheses about patterns in empirical data (e.g., deception detection, content analysis). Combines literature insights with data-driven hypothesis testing. For manual hypothesis formulation use hypothesis-generation; for creative ideation use scientific-brainstorming. | 54 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
histolab Lightweight WSI tile extraction and preprocessing. Use for basic slide processing tissue detection, tile extraction, stain normalization for H&E images. Best for simple pipelines, dataset preparation, quick tile-based analysis. For advanced spatial proteomics, multiplexed imaging, or deep learning pipelines use pathml. | 62 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
gtars High-performance toolkit for genomic interval analysis in Rust with Python bindings. Use when working with genomic regions, BED files, coverage tracks, overlap detection, tokenization for ML models, or fragment analysis in computational genomics and machine learning applications. | 55 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
gget Fast CLI/Python queries to 20+ bioinformatics databases. Use for quick lookups: gene info, BLAST searches, AlphaFold structures, enrichment analysis. Best for interactive exploration, simple queries. For batch processing or advanced BLAST use biopython; for multi-database Python workflows use bioservices. | 79 1.59x Agent success vs baseline Impact 99% 1.59xAverage score across 3 eval scenarios Securityby Risky Do not use without reviewing Reviewed: Version: cbcae7b | |
get-available-resources This skill should be used at the start of any computationally intensive scientific task to detect and report available system resources (CPU cores, GPUs, memory, disk space). It creates a JSON file with resource information and strategic recommendations that inform computational approach decisions such as whether to use parallel processing (joblib, multiprocessing), out-of-core computing (Dask, Zarr), GPU acceleration (PyTorch, JAX), or memory-efficient strategies. Use this skill before running analyses, training models, processing large datasets, or any task where resource constraints matter. | 74 2.84x Agent success vs baseline Impact 91% 2.84xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
geopandas Python library for working with geospatial vector data including shapefiles, GeoJSON, and GeoPackage files. Use when working with geographic data for spatial analysis, geometric operations, coordinate transformations, spatial joins, overlay operations, choropleth mapping, or any task involving reading/writing/analyzing vector geographic data. Supports PostGIS databases, interactive maps, and integration with matplotlib/folium/cartopy. Use for tasks like buffer analysis, spatial joins between datasets, dissolving boundaries, clipping data, calculating areas/distances, reprojecting coordinate systems, creating maps, or converting between spatial file formats. | 92 1.35x Agent success vs baseline Impact 87% 1.35xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
geniml This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning. | 58 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: cbcae7b | |
generate-image Generate or edit images using AI models (FLUX, Nano Banana 2). Use for general-purpose image generation including photos, illustrations, artwork, visual assets, concept art, and any image that is not a technical diagram or schematic. For flowcharts, circuits, pathways, and technical diagrams, use the scientific-schematics skill instead. | 86 1.78x Agent success vs baseline Impact 82% 1.78xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b | |
fluidsim Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis. | 68 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: cbcae7b | |
flowio Parse FCS (Flow Cytometry Standard) files v2.0-3.1. Extract events as NumPy arrays, read metadata/channels, convert to CSV/DataFrame, for flow cytometry data preprocessing. | 86 2.27x Agent success vs baseline Impact 91% 2.27xAverage score across 3 eval scenarios Securityby Passed No known issues Reviewed: Version: cbcae7b |