CtrlK
BlogDocsLog inGet started
Tessl Logo

neo4j-document-import-skill

Ingests unstructured and semi-structured documents into Neo4j as a knowledge graph. Use when chunking PDFs, HTML, plain text, or Markdown; extracting entities and relationships from text with an LLM (SimpleKGPipeline, neo4j-graphrag); loading JSON via apoc.load.json; building Document→Chunk→Entity graph structures; or connecting LangChain/LlamaIndex document loaders to Neo4j. Covers neo4j-graphrag SimpleKGPipeline, LLM Graph Builder web UI, entity resolution, chunking strategies, and graph schema design for RAG pipelines. Does NOT handle structured CSV/relational import — use neo4j-import-skill. Does NOT handle GraphRAG retrieval after ingestion — use neo4j-graphrag-skill. Does NOT handle vector index creation — use neo4j-vector-search-skill.

71

Quality

88%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a high-quality, comprehensive skill with excellent actionability and clear workflow sequencing. Its main weakness is length — several advanced/reference sections (GraphSchema deep-dive, Parquet export, LexicalGraphConfig, custom loaders) inflate the body beyond what's needed for the core workflow and should be offloaded to the referenced kg-construction.md file. The duplicated GraphSchema content between Step 1 and the later section also hurts conciseness.

Suggestions

Move the 'GraphSchema — Current API', 'Parquet Export', 'LexicalGraphConfig', and 'Custom Document Loaders' sections into references/kg-construction.md, keeping only brief pointers in the main SKILL.md body.

Eliminate the duplication between Step 1's schema definition and the later 'GraphSchema — Current API' section — keep the concise version in Step 1 and the detailed API reference in the bundle file.

DimensionReasoningScore

Conciseness

The skill is comprehensive and mostly efficient, but includes some redundancy — the GraphSchema section appears twice (once in Step 1 and again in the 'GraphSchema — Current API' section) with overlapping content. Some sections like LexicalGraphConfig and Custom Document Loaders could be moved to the referenced kg-construction.md file. However, it generally avoids explaining concepts Claude already knows.

2 / 3

Actionability

Excellent actionability throughout — every section provides fully executable code examples with correct imports, concrete Cypher queries, pip install commands, and specific parameter values. The decision table, chunking guidance table, and error table all provide immediately usable guidance.

3 / 3

Workflow Clarity

The skill follows a clear numbered step sequence (Steps 1–5) with explicit validation checkpoints: constraints must be ONLINE before ingestion, entity resolution runs after ingestion, and a verification checklist at the end provides a comprehensive feedback loop. The 'SHOW INDEXES' polling pattern and the 'wait then re-run' instruction demonstrate proper error recovery.

3 / 3

Progressive Disclosure

The skill references `references/kg-construction.md` for extended content and provides external documentation links, which is good. However, the body itself is quite long (~400+ lines) with sections like the full GraphSchema API reference, Parquet export, LexicalGraphConfig, and Custom Document Loaders that could be split into the referenced file. The bundle has no actual files to support the reference, and some content is duplicated between Step 1 and the later GraphSchema section.

2 / 3

Total

10

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that excels across all dimensions. It provides highly specific capabilities, rich trigger terms covering the full domain vocabulary, explicit 'Use when' guidance with concrete scenarios, and outstanding boundary definitions that prevent conflicts with related skills. The 'Does NOT handle' clauses are a best practice that other skill descriptions should emulate.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: chunking PDFs/HTML/text/Markdown, extracting entities and relationships with LLM, loading JSON via apoc.load.json, building Document→Chunk→Entity graph structures, connecting LangChain/LlamaIndex document loaders to Neo4j. Very detailed and actionable.

3 / 3

Completeness

Clearly answers 'what' (ingests unstructured/semi-structured documents into Neo4j as a knowledge graph, with detailed sub-capabilities) and 'when' (explicit 'Use when' clause listing specific trigger scenarios). Additionally includes explicit 'Does NOT' boundaries directing to other skills, which strengthens the when-to-use guidance.

3 / 3

Trigger Term Quality

Excellent coverage of natural terms users would say: PDFs, HTML, Markdown, plain text, knowledge graph, chunking, entity extraction, SimpleKGPipeline, neo4j-graphrag, LLM Graph Builder, LangChain, LlamaIndex, RAG pipelines, apoc.load.json, entity resolution. These are terms users working in this domain would naturally use.

3 / 3

Distinctiveness Conflict Risk

Exceptionally distinctive — explicitly delineates boundaries with three related skills (neo4j-import-skill for CSV, neo4j-graphrag-skill for retrieval, neo4j-vector-search-skill for vector indexes). The 'Does NOT handle' clauses make it very clear when this skill should NOT be selected, minimizing conflict risk.

3 / 3

Total

12

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

skill_md_line_count

SKILL.md is long (586 lines); consider splitting into references/ and linking

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
neo4j-contrib/neo4j-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.