CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl/pypi-pycuda

Python wrapper for Nvidia CUDA parallel computation API with object cleanup, automatic error checking, and convenient abstractions.

62

0.93x
Overview
Eval results
Files

Evaluation results

97%

GPU Kernel Performance Analyzer

Kernel function retrieval

Criteria
Without context
With context

Module loading

100%

100%

Function retrieval

100%

100%

Register count query

100%

100%

Shared memory query

100%

100%

Max threads query

100%

100%

CUDA context initialization

70%

70%

Error handling

100%

100%

97%

7%

GPU Context Manager with Synchronization

Context synchronization

Criteria
Without context
With context

Device initialization

33%

80%

Context creation

100%

100%

Context cleanup

100%

100%

Context synchronization

100%

100%

Device properties

100%

100%

45%

-10%

Multi-Module CUDA Kernel Linker

Module linking

Criteria
Without context
With context

PTX compilation

0%

100%

Linker creation

100%

0%

PTX addition

100%

0%

Module linking

0%

0%

Function retrieval

100%

100%

100%

Image Brightness Adjustment

Kernel launch with grid/block dimensions

Criteria
Without context
With context

Kernel Compilation

100%

100%

Grid/Block Configuration

100%

100%

Kernel Invocation

100%

100%

Memory Transfer

100%

100%

Context Initialization

100%

100%

90%

16%

GPU Memory Pool Manager

Advanced memory features (unified addressing, memory pools)

Criteria
Without context
With context

DeviceMemoryPool usage

60%

95%

PageLockedMemoryPool usage

73%

93%

Managed memory allocation

80%

92%

Pool-based allocation

93%

100%

Memory pool freeing

60%

80%

Pool statistics

60%

60%

Context initialization

100%

100%

72%

-28%

Vector Field Calculator

GPUArray arithmetic operations

Criteria
Without context
With context

GPUArray creation

100%

80%

Element-wise arithmetic

100%

60%

Magnitude calculation

100%

100%

Vector normalization

100%

25%

GPU-CPU transfer

100%

100%

Context initialization

100%

100%

0%

-25%

GPU Vector Operations with Automatic Memory Marshaling

Argument handlers (In, Out, InOut)

Criteria
Without context
With context

Kernel compilation

100%

0%

Prepared call usage

0%

0%

In handler usage

0%

0%

Out handler usage

0%

0%

Grid/block configuration

100%

0%

100%

GPU Data Transfer Optimizer

Pagelocked host memory

Criteria
Without context
With context

Pagelocked Buffer Creation

100%

100%

Device Memory Allocation

100%

100%

Host-to-Device Transfer

100%

100%

Device-to-Host Transfer

100%

100%

Context Management

100%

100%

Timing Implementation

100%

100%

0%

Image Sampler with Edge Handling

Texture addressing and filtering modes

Criteria
Without context
With context

CUDA Array Creation

0%

0%

Texture Reference Setup

0%

0%

Addressing Mode Configuration

0%

0%

Filtering Mode Configuration

0%

0%

Texture Sampling in Kernel

0%

0%

20%

5%

Sparse Linear System Solver

Conjugate Gradient solver

Criteria
Without context
With context

Sparse Matrix Format

0%

0%

CG Solver Invocation

0%

0%

Preconditioner Integration

0%

0%

Convergence Configuration

53%

80%

Result Extraction

70%

80%

Install with Tessl CLI

npx tessl i tessl/pypi-pycuda
Evaluated
Agent
Codex

Table of Contents