CtrlK
BlogDocsLog inGet started
Tessl Logo

langfuse-ci-integration

Configure Langfuse CI/CD integration with GitHub Actions and automated testing. Use when setting up automated testing, configuring CI pipelines, or integrating Langfuse tests into your build process. Trigger with phrases like "langfuse CI", "langfuse GitHub Actions", "langfuse automated tests", "CI langfuse", "langfuse pipeline".

64

Quality

77%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/saas-packs/langfuse-pack/skills/langfuse-ci-integration/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill provides highly actionable, executable guidance for Langfuse CI/CD integration with complete code examples across multiple use cases. Its main weakness is its length—it's a monolithic document that could benefit from splitting detailed steps into separate files, and it lacks explicit validation checkpoints between workflow steps. The error handling and best practices tables add practical value but contribute to the overall verbosity.

Suggestions

Split Steps 2-5 into separate referenced files (e.g., PROMPT-REGRESSION.md, EXPERIMENT-GATES.md) and keep SKILL.md as a concise overview with quick-start guidance and navigation links.

Add explicit validation checkpoints between steps, such as 'Verify traces appear in Langfuse dashboard before proceeding to Step 3' and a rollback/retry flow for the prompt deployment step.

Add an `afterAll` flush/shutdown call in the test examples (referenced in the best practices table but missing from the actual code), since this is flagged as a common CI issue.

DimensionReasoningScore

Conciseness

The skill is fairly long with extensive code examples across 5 steps. Some content could be tightened—the connectivity check step is somewhat unnecessary, and the best practices/error handling tables, while useful, add bulk. However, most content is novel domain-specific guidance that Claude wouldn't inherently know.

2 / 3

Actionability

Every step provides fully executable, copy-paste ready code—complete GitHub Actions YAML workflows, TypeScript test files, and deployment scripts with real imports, API calls, and assertions. The examples are concrete and specific.

3 / 3

Workflow Clarity

The steps are clearly sequenced (1-5) and cover a logical progression from CI setup to monitoring. However, there are no explicit validation checkpoints or feedback loops between steps—e.g., no 'verify traces appear before proceeding' step, no error recovery guidance within the workflow itself. The prompt deployment step (Step 4) is a destructive/batch operation with no validation gate.

2 / 3

Progressive Disclosure

The content is a monolithic document with all 5 steps inline, totaling well over 200 lines. Steps 2-5 could reasonably be split into separate referenced files. The external resource links at the bottom are helpful, but the main body would benefit from being an overview that points to detailed step files rather than containing everything inline.

2 / 3

Total

9

/

12

Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid skill description with excellent trigger term coverage and completeness, clearly specifying both what the skill does and when to use it. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed (e.g., creating workflow YAML files, configuring secrets, setting up test commands). The description also uses second person ('your build process') which is a minor style issue.

Suggestions

Expand the capability description with more concrete actions, e.g., 'Creates GitHub Actions workflow files, configures Langfuse API keys as secrets, sets up test execution steps, and validates pipeline output.'

Replace second person 'your build process' with third person phrasing like 'the build process' to match the expected voice.

DimensionReasoningScore

Specificity

Names the domain (Langfuse CI/CD with GitHub Actions) and mentions 'automated testing' and 'integration', but doesn't list multiple concrete actions beyond 'configure' — lacks specifics like 'create workflow files, set up test runners, configure environment secrets'.

2 / 3

Completeness

Clearly answers both 'what' (configure Langfuse CI/CD integration with GitHub Actions and automated testing) and 'when' (explicit 'Use when' clause plus a 'Trigger with phrases' section listing specific triggers).

3 / 3

Trigger Term Quality

Explicitly lists natural trigger phrases like 'langfuse CI', 'langfuse GitHub Actions', 'langfuse automated tests', 'CI langfuse', 'langfuse pipeline' — these are terms users would naturally say. Also includes broader terms like 'CI pipelines' and 'automated testing'.

3 / 3

Distinctiveness Conflict Risk

Highly specific niche — Langfuse + CI/CD + GitHub Actions is a very narrow domain. The explicit trigger phrases all include 'langfuse' which makes it unlikely to conflict with generic CI/CD or testing skills.

3 / 3

Total

11

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
jeremylongshore/claude-code-plugins-plus-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.