CtrlK
BlogDocsLog inGet started
Tessl Logo

langfuse-ci-integration

Configure Langfuse CI/CD integration with GitHub Actions and automated testing. Use when setting up automated testing, configuring CI pipelines, or integrating Langfuse tests into your build process. Trigger with phrases like "langfuse CI", "langfuse GitHub Actions", "langfuse automated tests", "CI langfuse", "langfuse pipeline".

80

Quality

77%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/saas-packs/langfuse-pack/skills/langfuse-ci-integration/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid skill description with excellent completeness and distinctiveness. It clearly specifies when to use the skill and provides explicit trigger phrases. The main weakness is that the 'what' portion could be more specific about the concrete actions performed beyond just 'configure'. Note: the description uses second person ('your build process') which is a minor style issue.

Suggestions

Add more specific concrete actions, e.g., 'Creates GitHub Actions workflow files, configures environment variables and secrets, sets up Langfuse test runners, and validates pipeline output.'

DimensionReasoningScore

Specificity

Names the domain (Langfuse CI/CD with GitHub Actions) and mentions 'automated testing' and 'integration', but doesn't list multiple concrete actions beyond 'configure' — lacks specifics like 'create workflow files', 'set up test runners', 'configure environment secrets', etc.

2 / 3

Completeness

Clearly answers both 'what' (configure Langfuse CI/CD integration with GitHub Actions and automated testing) and 'when' (setting up automated testing, configuring CI pipelines, integrating Langfuse tests into build process) with explicit 'Use when' and 'Trigger with' clauses.

3 / 3

Trigger Term Quality

Includes a good set of natural trigger phrases: 'langfuse CI', 'langfuse GitHub Actions', 'langfuse automated tests', 'CI langfuse', 'langfuse pipeline'. These are terms users would naturally say, and the explicit trigger list covers common variations well.

3 / 3

Distinctiveness Conflict Risk

Highly specific niche combining Langfuse + CI/CD + GitHub Actions. The trigger terms are all Langfuse-prefixed, making it very unlikely to conflict with generic CI/CD skills or other Langfuse skills focused on different aspects.

3 / 3

Total

11

/

12

Passed

Implementation

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a highly actionable skill with complete, executable code examples covering multiple CI/CD integration scenarios for Langfuse. Its main weaknesses are length (could split detailed examples into referenced files) and missing validation/feedback loops between workflow steps, particularly around prompt deployment. The error handling and best practices tables are well-done and add value efficiently.

Suggestions

Add explicit validation checkpoints between steps, especially a verification step after prompt deployment (Step 4) to confirm prompts were deployed correctly before relying on them.

Consider splitting the detailed test examples (Steps 2-3) and deployment scripts (Steps 4-5) into separate referenced files, keeping SKILL.md as a concise overview with quick-start guidance and links to detailed examples.

DimensionReasoningScore

Conciseness

The skill is fairly efficient but quite long. The code examples are thorough and mostly earn their place, but the overall volume is high — some examples (like the connectivity check step, or the full experiment test) could be trimmed. The best practices and error handling tables are concise and useful.

2 / 3

Actionability

Fully executable code examples throughout: complete GitHub Actions YAML workflows, TypeScript test files, deployment scripts, and monitoring scripts. All are copy-paste ready with specific file paths, environment variables, and concrete assertions.

3 / 3

Workflow Clarity

Steps are clearly numbered and sequenced, but there are no explicit validation checkpoints or feedback loops between steps. For example, Step 4 (prompt deployment) has no validation that the deployment succeeded before proceeding, and there's no guidance on what to do if tests fail in Step 1 before moving to deployment. The destructive nature of prompt deployment to production warrants validation steps.

2 / 3

Progressive Disclosure

The content is well-structured with clear headings and a logical progression, and it links to external Langfuse docs at the end. However, the skill is quite long (~200+ lines of code) and could benefit from splitting detailed test examples into separate reference files, keeping SKILL.md as a concise overview with pointers.

2 / 3

Total

9

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
jeremylongshore/claude-code-plugins-plus-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.