Name: tessl/pypi-scikit-learn
Rating: 87 (1 reviews)
Author: tessl

tessl/pypi-scikit-learn

A comprehensive machine learning library providing supervised and unsupervised learning algorithms with consistent APIs and extensive tools for data preprocessing, model evaluation, and deployment.

0.98x

Quality

Pending

Does it follow best practices?

Impact

87%

0.98x

Average score across 10 eval scenarios

Securityby

Pending

The risk profile of this skill

Overview

Eval results

Files

{
  "context": "Evaluates whether the solution builds the requested unsupervised workflow using scikit-learn's preprocessing, decomposition, mixture, and manifold tools. Checks focus on correct use of StandardScaler, PCA-based variance retention, GaussianMixture model selection via BIC, and deterministic 2D manifold embedding driven by random_state.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Scaling + PCA",
      "description": "Fits sklearn.preprocessing.StandardScaler then sklearn.decomposition.PCA with n_components reaching >=0.90 explained variance (e.g., n_components=0.9) on training data and reuses them for predictions.",
      "max_score": 25
    },
    {
      "name": "Mixture selection",
      "description": "Trains sklearn.mixture.GaussianMixture over the provided cluster_counts, compares Bayesian/Akaike information criterion (e.g., bic) to pick the best count, stores the selected count, and seeds the model with random_state.",
      "max_score": 30
    },
    {
      "name": "Soft predictions",
      "description": "predict() pipes data through the fitted scaler and PCA before calling GaussianMixture.predict and predict_proba, returns labels plus max responsibility per sample, and rejects calls before fit.",
      "max_score": 15
    },
    {
      "name": "Manifold embedding",
      "description": "embedding_2d() runs a manifold method from sklearn.manifold (e.g., Isomap or LocallyLinearEmbedding) on the PCA-transformed training data with n_components=2, passes random_state when supported, caches/returns deterministic output, and errors if unfitted.",
      "max_score": 20
    },
    {
      "name": "Validation errors",
      "description": "Raises ValueError during fit when non-finite entries are present or when min(cluster_counts) exceeds available samples before attempting to train any estimator.",
      "max_score": 10
    }
  ]
}

tessl/pypi-scikit-learn

criteria.json.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}evals/scenario-8/

criteria.jsonevals/scenario-8/