tessl/pypi-skl2onnx

Convert scikit-learn models to ONNX format for cross-platform inference and deployment

—

Pending

Overview

Eval results

Files

skl2onnx

Name: tessl/pypi-skl2onnx
Author: tessl

skl2onnx provides a comprehensive conversion library that transforms scikit-learn machine learning models into the ONNX (Open Neural Network Exchange) format, enabling high-performance inference and cross-platform deployment. The library supports a wide range of scikit-learn models including classifiers, regressors, transformers, and preprocessing components, with automatic shape inference and type conversion to ensure compatibility with ONNX runtime environments.

Package Information

Package Name: skl2onnx
Language: Python
Installation: pip install skl2onnx

Core Imports

import skl2onnx

Common for model conversion:

from skl2onnx import convert_sklearn, to_onnx
from skl2onnx.common.data_types import FloatTensorType

Basic Usage

from sklearn.ensemble import RandomForestClassifier
from sklearn.datasets import make_classification
from skl2onnx import to_onnx
from skl2onnx.common.data_types import FloatTensorType
import numpy as np

# Create and train a model
X, y = make_classification(n_samples=100, n_features=4, random_state=42)
model = RandomForestClassifier(n_estimators=10, random_state=42)
model.fit(X, y)

# Convert to ONNX with automatic type inference
onnx_model = to_onnx(model, X)

# Or specify types explicitly
initial_type = [('float_input', FloatTensorType([None, 4]))]
onnx_model = convert_sklearn(model, initial_types=initial_type)

# Save the ONNX model
with open("model.onnx", "wb") as f:
    f.write(onnx_model.SerializeToString())

Architecture

The skl2onnx conversion system is built around several key components:

Conversion Engine: Core functions that orchestrate the transformation from sklearn to ONNX
Type System: Comprehensive data type definitions that map Python/NumPy types to ONNX types
Shape Calculators: Functions that infer output shapes for each operator during conversion
Operator Converters: Implementation modules that convert specific sklearn models to ONNX operators
Registration System: Extensible framework for adding custom converters and parsers
Parser System: Components that analyze sklearn models to extract conversion-relevant information

This modular design enables support for 100+ sklearn model types while maintaining extensibility for custom operators and third-party model types.

Capabilities

Core Conversion Functions

Primary functions for converting scikit-learn models to ONNX format, including the main conversion engine, simplified conversion with type inference, and mixin enhancement for sklearn models.

def convert_sklearn(model, name=None, initial_types=None, doc_string="", 
                   target_opset=None, custom_conversion_functions=None,
                   custom_shape_calculators=None, custom_parsers=None,
                   options=None, intermediate=False, white_op=None, 
                   black_op=None, final_types=None, dtype=None,
                   naming=None, model_optim=True, verbose=0): ...

def to_onnx(model, X=None, name=None, initial_types=None, target_opset=None,
           options=None, white_op=None, black_op=None, final_types=None,
           dtype=None, naming=None, model_optim=True, verbose=0): ...

def wrap_as_onnx_mixin(model, target_opset=None): ...

Core Conversion

Data Types and Type System

Comprehensive type system for ONNX conversion including tensor types, scalar types, sequence types, and automatic type inference utilities. Supports all major numpy and ONNX data types with shape validation.

class FloatTensorType: ...
class Int64TensorType: ...
class StringTensorType: ...
class BooleanTensorType: ...

def guess_data_type(data_type): ...
def guess_initial_types(X, initial_types=None): ...

Data Types

Registration and Extensibility

System for registering custom converters, parsers, and operators. Includes discovery of supported models and extensibility framework for third-party model types.

def update_registered_converter(model, alias=None, shape_fct=None, 
                               convert_fct=None, overwrite=False,
                               parser=None, options=None): ...

def update_registered_parser(model, parser_fct=None, overwrite=False): ...

def supported_converters(from_sklearn=False): ...

def get_model_alias(model_type): ...

Registration

Helper Utilities

Investigation and integration utilities for debugging conversions, comparing outputs between sklearn and ONNX models, and integrating custom ONNX graphs.

def collect_intermediate_steps(model, X=None, target_opset=None): ...
def compare_objects(sklearn_output, onnx_output, decimal=5): ...
def enumerate_pipeline_models(model): ...
def add_onnx_graph(onx, to_add, inputs, outputs): ...

Helpers

Algebra and ONNX Operators

ONNX operator creation system and mixin classes for enhancing scikit-learn models with ONNX capabilities. Enables direct ONNX operator composition and sklearn integration.

class OnnxOperator: ...
class OnnxOperatorMixin: ...

Algebra

Supported Models

The library provides extensive support for scikit-learn models:

60+ Classifiers - Linear, tree-based, ensemble, neural network, naive Bayes, SVM, neighbors, and meta-classifiers
40+ Regressors - Linear, tree-based, ensemble, neural network, SVM, gaussian processes, and specialized regressors
30+ Transformers - Scaling, encoding, feature engineering, text processing, imputation, decomposition, feature selection, discretization, and pipelines
Clustering & Outlier Detection - KMeans, isolation forest, local outlier factor, gaussian mixtures

Complete model support list available in the registration documentation.

Version Information

__version__ = "1.19.1"
__max_supported_opset__ = 21

def get_latest_tested_opset_version(): ...

Error Handling

The library raises specific exceptions for common conversion issues:

MissingConverter - When no converter is registered for a model type
MissingShapeCalculator - When shape inference fails for an operator
Various runtime errors for unsupported operations or configuration conflicts

Install with Tessl CLI

npx tessl i tessl/pypi-skl2onnx

Workspace: tessl
Visibility: Public
Created: 6 months ago
Last updated: about 1 month ago
Describes: pkg:pypi/skl2onnx@1.19.x