libtransform - Resource transformation utilities. pdfToHtml converts PDF documents to HTML using LLM vision capabilities, splitting pages to images and processing with multimodal models. Use for document conversion, PDF processing, and knowledge extraction from documents.
76
Does it follow best practices?
If you maintain this skill, you can automatically optimize it using the tessl CLI to improve its score:
npx tessl skill review --optimize ./path/to/skillValidation for skill structure
pdfToHtml: Splits PDF into page images, sends to vision-capable LLM, and assembles HTML output with semantic structure.
import { pdfToHtml } from "@copilot-ld/libtransform";
const pdfBuffer = await fs.readFile("document.pdf");
const html = await pdfToHtml(pdfBuffer, {
model: "gpt-4-vision-preview",
maxPages: 50,
});Used by libingest pipeline for document processing. Requires LLM with vision capabilities.
da36296
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.