CtrlK
BlogDocsLog inGet started
Tessl Logo

microbiome-diversity-reporter

Interpret Alpha and Beta diversity metrics from 16S rRNA sequencing results and generate visualization reports for microbiome analysis.

Install with Tessl CLI

npx tessl i github:aipoch/medical-research-skills --skill microbiome-diversity-reporter
What are skills?

78

Does it follow best practices?

Validation for skill structure

SKILL.md
Review
Evals

Microbiome Diversity Reporter


Overview

This tool is used to analyze and interpret diversity metrics in microbiome 16S rRNA sequencing data, including:

  • Alpha Diversity: Species diversity within a single sample
  • Beta Diversity: Species composition differences between samples

Usage

Command Line

# Analyze Alpha diversity for a single sample
python scripts/main.py --input otu_table.tsv --metric shannon --output alpha_report.html

# Analyze Beta diversity (PCoA)
python scripts/main.py --input otu_table.tsv --beta --metadata metadata.tsv --output beta_report.html

# Generate full report (Alpha + Beta)
python scripts/main.py --input otu_table.tsv --full --metadata metadata.tsv --output diversity_report.html

Parameter Description

ParameterDescriptionRequired
--inputOTU/ASV table path (TSV format)Yes
--metadataSample metadata (TSV format)Required for Beta diversity
--metricAlpha diversity metric: shannon, simpson, chao1, observed_otusNo (default: shannon)
--alphaCalculate Alpha diversity onlyNo
--betaCalculate Beta diversity onlyNo
--fullGenerate full report (Alpha + Beta)No
--outputOutput report pathNo (default: stdout)
--formatOutput format: html, json, markdownNo (default: html)

Input Format

OTU Table (TSV)

#OTU ID	Sample1	Sample2	Sample3
OTU_1	100	50	200
OTU_2	50	100	0
OTU_3	25	25	50

Metadata (TSV)

SampleID	Group	Age	Gender
Sample1	Control	25	M
Sample2	Treatment	30	F
Sample3	Treatment	28	M

Output

Generates HTML/JSON/Markdown reports containing:

  1. Alpha Diversity Results

    • Diversity index values
    • Rarefaction curves
    • Box plots (by group)
  2. Beta Diversity Results

    • PCoA scatter plots
    • NMDS plots
    • Distance matrix heatmaps
    • PERMANOVA statistical tests
  3. Statistical Summary

    • Sample information statistics
    • Species richness
    • Diversity index distribution

Dependencies

  • Python 3.8+
  • numpy
  • pandas
  • scipy
  • scikit-bio
  • matplotlib
  • seaborn
  • plotly (for interactive charts)

Example Output

{
  "alpha_diversity": {
    "shannon": {
      "Sample1": 2.45,
      "Sample2": 1.89,
      "Sample3": 2.12
    },
    "statistics": {
      "mean": 2.15,
      "std": 0.28
    }
  },
  "beta_diversity": {
    "method": "braycurtis",
    "pcoa": {
      "variance_explained": [0.45, 0.25, 0.15]
    }
  }
}

References

  1. Shannon, C.E. (1948) A mathematical theory of communication
  2. Simpson, E.H. (1949) Measurement of diversity
  3. Chao, A. (1984) Non-parametric estimation of classes
  4. Lozupone et al. (2005) UniFrac: a phylogenetic metric

Risk Assessment

Risk IndicatorAssessmentLevel
Code ExecutionPython/R scripts executed locallyMedium
Network AccessNo external API callsLow
File System AccessRead input files, write output filesMedium
Instruction TamperingStandard prompt guidelinesLow
Data ExposureOutput files saved to workspaceLow

Security Checklist

  • No hardcoded credentials or API keys
  • No unauthorized file system access (../)
  • Output does not expose sensitive information
  • Prompt injection protections in place
  • Input file paths validated (no ../ traversal)
  • Output directory restricted to workspace
  • Script execution in sandboxed environment
  • Error messages sanitized (no stack traces exposed)
  • Dependencies audited

Prerequisites

# Python dependencies
pip install -r requirements.txt

Evaluation Criteria

Success Metrics

  • Successfully executes main functionality
  • Output meets quality standards
  • Handles edge cases gracefully
  • Performance is acceptable

Test Cases

  1. Basic Functionality: Standard input → Expected output
  2. Edge Case: Invalid input → Graceful error handling
  3. Performance: Large dataset → Acceptable processing time

Lifecycle Status

  • Current Stage: Draft
  • Next Review Date: 2026-03-06
  • Known Issues: None
  • Planned Improvements:
    • Performance optimization
    • Additional feature support
Repository
aipoch/medical-research-skills
Last updated
Created

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.