A clinical-grade PII/PHI detection and de-identification tool for healthcare text data.
41
27%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./scientific-skills/Academic Writing/hipaa-compliance-auditor/SKILL.mdA clinical-grade PII/PHI detection and de-identification tool for healthcare text data.
See ## Features above for related details.
scripts/main.py.references/ for task-specific guidance.See references/requirements.txt for full dependency list.
See ## Usage above for related details.
cd "20260318/scientific-skills/Academic Writing/hipaa-compliance-auditor"
python -m py_compile scripts/main.py
python scripts/main.py --helpExample run plan:
CONFIG block or documented parameters if the script uses fixed settings.python scripts/main.py with the validated inputs.See ## Workflow above for related details.
scripts/main.py.references/ contains supporting rules, prompts, or checklists.Use this command to verify that the packaged script entry point can be parsed before deeper execution.
python -m py_compile scripts/main.pyUse these concrete commands for validation. They are intentionally self-contained and avoid placeholder paths.
python -m py_compile scripts/main.py
python scripts/main.py --help
python scripts/main.py --text "Audit validation sample with explicit methods, findings, and conclusion."This skill analyzes text for HIPAA-protected identifiers and automatically redacts or anonymizes them. It uses a combination of regex patterns, NLP entity recognition, and contextual analysis to identify 18 HIPAA identifier categories.
[PATIENT_NAME], [DATE_1])python scripts/main.py --input "patient_text.txt" --output "deidentified.txt"
python scripts/main.py --text "Patient John Doe, SSN 123-45-6789..." --audit-log audit.jsonfrom scripts.main import HIPAAAuditor
auditor = HIPAAAuditor()
result = auditor.deidentify("Patient John Doe was admitted on 2024-01-15...")
print(result.cleaned_text) # De-identified output
print(result.detected_pii) # List of found PII entities| Parameter | Type | Default | Required | Description |
|---|---|---|---|---|
--input, -i | string | - | No | Path to input text file |
--text | string | - | No | Direct text input (alternative to file) |
--output, -o | string | - | No | Path for de-identified output file |
--audit-log | string | - | No | Path for JSON audit log |
--confidence | float | 0.7 | No | Minimum confidence threshold (0.0-1.0) |
--preserve-structure | bool | true | No | Maintain document structure |
--custom-patterns | string | - | No | Path to custom regex patterns JSON |
Original identifiers replaced with semantic tags:
[PATIENT_NAME_1], [PATIENT_NAME_2] ...[DATE_1], [DATE_2] ...[SSN_1][PHONE_1], [PHONE_2] ...[EMAIL_1][MRN_1] (Medical Record Number)[ADDRESS_1]{
"timestamp": "2024-01-15T10:30:00Z",
"input_hash": "sha256:abc123...",
"detections": [
{
"type": "PATIENT_NAME",
"position": [10, 18],
"confidence": 0.95,
"replacement": "[PATIENT_NAME_1]",
"original_length": 8
}
],
"statistics": {
"total_pii_found": 5,
"categories_detected": ["NAME", "DATE", "PHONE", "SSN"]
}
}⚠️ CRITICAL: This tool is designed as a helper, not a replacement for human review.
references/hipaa_safe_harbor_guide.pdf - HIPAA Safe Harbor de-identification standardsreferences/pii_patterns.json - Complete regex pattern definitionsreferences/test_cases/ - Sample clinical texts with expected outputsreferences/requirements.txt - Python dependenciesComplex NLP pipelines, contextual disambiguation, regulatory compliance requirements.
| Risk Indicator | Assessment | Level |
|---|---|---|
| Code Execution | Python/R scripts executed locally | Medium |
| Network Access | No external API calls | Low |
| File System Access | Read input files, write output files | Medium |
| Instruction Tampering | Standard prompt guidelines | Low |
| Data Exposure | Output files saved to workspace | Low |
# Python dependencies
pip install -r requirements.txtEvery final response should make these items explicit when they are relevant:
scripts/main.py fails, report the failure point, summarize what still can be completed safely, and provide a manual fallback.This skill accepts requests that match the documented purpose of hipaa-compliance-auditor and include enough context to complete the workflow safely.
Do not continue the workflow when the request is out of scope, missing a critical input, or would require unsupported assumptions. Instead respond:
hipaa-compliance-auditoronly handles its documented workflow. Please provide the missing required inputs or switch to a more suitable skill.
Use the following fixed structure for non-trivial requests:
If the request is simple, you may compress the structure, but still keep assumptions and limits explicit when they affect correctness.
8277276
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.