langfuse-ci-integration

Configure Langfuse CI/CD integration with GitHub Actions and automated testing. Use when setting up automated testing, configuring CI pipelines, or integrating Langfuse tests into your build process. Trigger with phrases like "langfuse CI", "langfuse GitHub Actions", "langfuse automated tests", "CI langfuse", "langfuse pipeline".

Quality

77%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/saas-packs/langfuse-pack/skills/langfuse-ci-integration/SKILL.md

Quality

Discovery

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid description with excellent trigger terms and completeness, clearly specifying both when to use it and what it does. Its main weakness is that the 'what' portion could be more specific about the concrete actions performed (e.g., creating workflow YAML files, configuring secrets, setting up test commands). The description also uses second person ('your build process') which is a minor style issue.

Suggestions

Add more specific concrete actions like 'Creates GitHub Actions workflow files, configures environment secrets, sets up test execution steps' to improve specificity.

Replace second person 'your build process' with third person phrasing like 'the build process' to match the preferred voice.

Dimension	Reasoning	Score
Specificity	Names the domain (Langfuse CI/CD with GitHub Actions) and mentions 'automated testing' and 'integration', but doesn't list multiple concrete actions beyond 'configure' — lacks specifics like 'create workflow files', 'set up test runners', 'configure environment secrets', etc.	2 / 3
Completeness	Clearly answers both 'what' (configure Langfuse CI/CD integration with GitHub Actions and automated testing) and 'when' (explicit 'Use when' clause plus 'Trigger with phrases like' section providing concrete trigger terms).	3 / 3
Trigger Term Quality	Includes a good set of natural trigger phrases: 'langfuse CI', 'langfuse GitHub Actions', 'langfuse automated tests', 'CI langfuse', 'langfuse pipeline'. Also includes broader terms like 'automated testing', 'CI pipelines', and 'build process' that users would naturally say.	3 / 3
Distinctiveness Conflict Risk	The combination of 'Langfuse' + 'CI/CD' + 'GitHub Actions' creates a very specific niche. The trigger terms are all Langfuse-prefixed, making it unlikely to conflict with generic CI/CD skills or other Langfuse skills focused on different aspects.	3 / 3
	Total	11 / 12 Passed

Implementation

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a highly actionable skill with complete, executable code examples covering the full CI/CD integration lifecycle for Langfuse. Its main weaknesses are length (could benefit from splitting detailed code into bundle files) and missing explicit validation/feedback loops between pipeline stages. The error handling table is a nice touch but the overall document would benefit from tighter organization.

Suggestions

Add explicit validation checkpoints between steps—e.g., verify prompt deployment succeeded before running regression tests, and define what happens when quality gates fail (block deploy, alert, etc.).

Split the detailed TypeScript test files and deployment scripts into separate bundle files, keeping SKILL.md as a concise overview with references to each file.

Dimension	Reasoning	Score
Conciseness	The skill is fairly long with extensive code examples across 5 steps. Some content could be tightened—the connectivity check step is somewhat unnecessary, and the best practices/error handling tables, while useful, add bulk. However, most code examples earn their place as they demonstrate non-obvious integration patterns.	2 / 3
Actionability	Every step provides fully executable, copy-paste ready code—complete GitHub Actions YAML workflows, TypeScript test files, and deployment scripts with real imports, API calls, and assertions. The examples are concrete and specific with realistic test scenarios.	3 / 3
Workflow Clarity	The 5 steps are clearly sequenced and logically ordered (setup → test → gate → deploy → monitor). However, there are no explicit validation checkpoints or feedback loops between steps—e.g., no guidance on what to do if the quality gate fails before deployment, or verification that prompt deployment succeeded before running regression tests.	2 / 3
Progressive Disclosure	The skill has reasonable section structure and links to external Langfuse docs, but all content is inline in a single long file. The detailed test files and deployment scripts could be split into referenced bundle files, with SKILL.md serving as a concise overview pointing to them.	2 / 3
	Total	9 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	9 / 11 Passed

Repository: jeremylongshore/claude-code-plugins-plus-skills
Commit: 23fe3bf

Reviewed: 3 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.