CtrlK
BlogDocsLog inGet started
Tessl Logo

langfuse-ci-integration

Configure Langfuse CI/CD integration with GitHub Actions and automated testing. Use when setting up automated testing, configuring CI pipelines, or integrating Langfuse tests into your build process. Trigger with phrases like "langfuse CI", "langfuse GitHub Actions", "langfuse automated tests", "CI langfuse", "langfuse pipeline".

64

Quality

77%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/saas-packs/langfuse-pack/skills/langfuse-ci-integration/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid description with excellent trigger terms and completeness, clearly specifying both when to use it and what it does. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed (e.g., creating workflow YAML files, configuring secrets, setting up test commands). The description also uses second person ('your build process') which is a minor style issue.

Suggestions

Add more specific concrete actions like 'Creates GitHub Actions workflow files, configures environment secrets, sets up test execution steps' to improve specificity.

Replace second person 'your build process' with third person phrasing like 'the build process' to match the preferred voice.

DimensionReasoningScore

Specificity

Names the domain (Langfuse CI/CD with GitHub Actions) and mentions 'automated testing' and 'integration', but doesn't list multiple concrete actions beyond 'configure' — lacks specifics like 'create workflow files', 'set up test runners', 'configure environment secrets', etc.

2 / 3

Completeness

Clearly answers both 'what' (configure Langfuse CI/CD integration with GitHub Actions and automated testing) and 'when' (explicit 'Use when' clause plus 'Trigger with phrases like' section providing concrete trigger terms).

3 / 3

Trigger Term Quality

Includes a good set of natural trigger phrases: 'langfuse CI', 'langfuse GitHub Actions', 'langfuse automated tests', 'CI langfuse', 'langfuse pipeline'. Also includes broader terms like 'automated testing', 'CI pipelines', and 'build process' that users would naturally say.

3 / 3

Distinctiveness Conflict Risk

The combination of 'Langfuse' + 'CI/CD' + 'GitHub Actions' creates a very specific niche. The trigger terms are all Langfuse-prefixed, making it unlikely to conflict with generic CI/CD skills or other Langfuse skills focused on different aspects.

3 / 3

Total

11

/

12

Passed

Implementation

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a highly actionable skill with complete, executable code examples covering the full CI/CD integration lifecycle for Langfuse. Its main weaknesses are length (could benefit from splitting detailed code into bundle files) and missing explicit validation/feedback loops between pipeline stages. The error handling table is a nice touch but the overall document would benefit from tighter organization.

Suggestions

Add explicit validation checkpoints between steps—e.g., verify prompt deployment succeeded before running regression tests, and define what happens when quality gates fail (block deploy, alert, etc.).

Split the detailed TypeScript test files and deployment scripts into separate bundle files, keeping SKILL.md as a concise overview with references to each file.

DimensionReasoningScore

Conciseness

The skill is fairly long with extensive code examples across 5 steps. Some content could be tightened—the connectivity check step is somewhat unnecessary, and the best practices/error handling tables, while useful, add bulk. However, most code examples earn their place as they demonstrate non-obvious integration patterns.

2 / 3

Actionability

Every step provides fully executable, copy-paste ready code—complete GitHub Actions YAML workflows, TypeScript test files, and deployment scripts with real imports, API calls, and assertions. The examples are concrete and specific with realistic test scenarios.

3 / 3

Workflow Clarity

The 5 steps are clearly sequenced and logically ordered (setup → test → gate → deploy → monitor). However, there are no explicit validation checkpoints or feedback loops between steps—e.g., no guidance on what to do if the quality gate fails before deployment, or verification that prompt deployment succeeded before running regression tests.

2 / 3

Progressive Disclosure

The skill has reasonable section structure and links to external Langfuse docs, but all content is inline in a single long file. The detailed test files and deployment scripts could be split into referenced bundle files, with SKILL.md serving as a concise overview pointing to them.

2 / 3

Total

9

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
jeremylongshore/claude-code-plugins-plus-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.