or run

npx @tessl/cli init

Version

Tile

Overview

Evals

Files

docs

auto-models.md base-classes.md decoder-embedders.md encoder-embedders.md index.md model-types.md rerankers.md

tile.json

tessl/pypi-flagembedding

FlagEmbedding - BGE: One-Stop Retrieval Toolkit For Search and RAG

Workspace: tessl
Visibility: Public
Created: 3 months ago
Last updated: 3 months ago
Describes: pkg:pypi/flagembedding@1.3.x

To install, run

npx @tessl/cli install tessl/pypi-flagembedding@1.3.0

FlagEmbedding

FlagEmbedding (BGE - BAAI General Embedding) is a comprehensive Python library focused on retrieval-augmented language models and embedding technologies. The package provides state-of-the-art text embedding models, rerankers, and multimodal embedding capabilities for search and RAG applications. It includes tools for both inference and fine-tuning of embedding models, evaluation frameworks, and supports various embedding tasks including text-to-text, text-to-image, and image-to-text retrieval.

Package Information

Package Name: FlagEmbedding
Language: Python
Installation: pip install FlagEmbedding

Core Imports

from FlagEmbedding import FlagAutoModel, FlagAutoReranker

For direct model access:

from FlagEmbedding import FlagModel, BGEM3FlagModel, FlagReranker
from FlagEmbedding import FlagLLMModel, FlagICLModel, FlagLLMReranker

For model class enumeration:

from FlagEmbedding import EmbedderModelClass, RerankerModelClass

Basic Usage

from FlagEmbedding import FlagAutoModel, FlagAutoReranker

# Initialize embedder with automatic model selection
embedder = FlagAutoModel.from_finetuned('bge-large-en-v1.5', use_fp16=True)

# Encode queries and documents
queries = ["What is machine learning?", "How does neural networks work?"]
documents = [
    "Machine learning is a subset of artificial intelligence...",
    "Neural networks are computing systems inspired by biological neural networks..."
]

query_embeddings = embedder.encode_queries(queries)
doc_embeddings = embedder.encode_corpus(documents)

# Initialize reranker for scoring
reranker = FlagAutoReranker.from_finetuned('bge-reranker-base')

# Score query-document pairs
pairs = [("What is machine learning?", "Machine learning is a subset of artificial intelligence...")]
scores = reranker.compute_score(pairs)

print(f"Similarity score: {scores[0]}")

Architecture

FlagEmbedding is built around a hierarchical architecture that supports multiple model types and architectures:

Abstract Base Classes: AbsEmbedder and AbsReranker define the interface contracts for all embedding/reranking models
Auto Models: Factory classes (FlagAutoModel, FlagAutoReranker) that automatically select appropriate model implementations
Concrete Implementations: Specialized classes for encoder-only models (BERT-like), decoder-only models (LLM-like), and hybrid approaches
Multi-device Support: Built-in parallelization across multiple GPUs/devices for scalable inference

Capabilities

Auto Model Factory

Automatically selects and initializes the appropriate embedder or reranker class based on the model name, providing the simplest way to use FlagEmbedding with any supported model.

class FlagAutoModel:
    @classmethod
    def from_finetuned(
        cls, 
        model_name_or_path: str, 
        model_class: Optional[str] = None,
        normalize_embeddings: bool = True,
        use_fp16: bool = True,
        query_instruction_for_retrieval: Optional[str] = None,
        devices: Optional[List[str]] = None,
        pooling_method: Optional[str] = None,
        trust_remote_code: Optional[bool] = None,
        **kwargs
    ) -> AbsEmbedder: ...

class FlagAutoReranker:
    @classmethod
    def from_finetuned(
        cls,
        model_name_or_path: str,
        model_class: Optional[str] = None,
        use_fp16: bool = False,
        trust_remote_code: Optional[bool] = None,
        **kwargs
    ) -> AbsReranker: ...

Auto Models

Encoder-Only Embedders

Embedders designed for encoder-only transformer models (BERT-like architectures), including specialized implementations for BGE-M3 models with dense, sparse, and ColBERT support.

class FlagModel(AbsEmbedder):
    def __init__(
        self,
        model_name_or_path: str,
        pooling_method: str = "cls",
        normalize_embeddings: bool = True,
        use_fp16: bool = True,
        trust_remote_code: bool = False,
        **kwargs
    ): ...

class BGEM3FlagModel(AbsEmbedder):
    def __init__(
        self,
        model_name_or_path: str,
        pooling_method: str = "cls",
        normalize_embeddings: bool = True,
        use_fp16: bool = True,
        colbert_dim: int = -1,
        return_dense: bool = True,
        return_sparse: bool = False,
        return_colbert_vecs: bool = False,
        **kwargs
    ): ...

Encoder-Only Embedders

Decoder-Only Embedders

Embedders for decoder-only transformer models (LLM-like architectures), including support for in-context learning approaches.

class FlagLLMModel(AbsEmbedder):
    def __init__(
        self,
        model_name_or_path: str,
        pooling_method: str = "last_token",
        normalize_embeddings: bool = True,
        use_fp16: bool = True,
        query_instruction_format: str = "Instruct: {}\\nQuery: {}",
        **kwargs
    ): ...

class FlagICLModel(AbsEmbedder):
    def __init__(
        self,
        model_name_or_path: str,
        pooling_method: str = "last_token",
        normalize_embeddings: bool = True,
        use_fp16: bool = True,
        **kwargs
    ): ...

Decoder-Only Embedders

Reranking Models

Reranking models for scoring query-document pairs, available in both encoder-only and decoder-only variants with specialized implementations for different use cases.

class FlagReranker(AbsReranker):
    def __init__(
        self,
        model_name_or_path: str,
        use_fp16: bool = False,
        trust_remote_code: bool = False,
        **kwargs
    ): ...

class FlagLLMReranker(AbsReranker):
    def __init__(
        self,
        model_name_or_path: str,
        use_fp16: bool = False,
        **kwargs
    ): ...

Rerankers

Base Classes and Interfaces

Abstract base classes that define the core interface contracts for embedders and rerankers, providing multi-device support and consistent API patterns.

class AbsEmbedder:
    def encode_queries(
        self, 
        queries: List[str], 
        batch_size: Optional[int] = None,
        max_length: Optional[int] = None,
        convert_to_numpy: Optional[bool] = None,
        **kwargs
    ) -> Union[torch.Tensor, np.ndarray]: ...

class AbsReranker:
    def compute_score(
        self, 
        sentence_pairs: List[Tuple[str, str]], 
        **kwargs
    ) -> np.ndarray: ...

Base Classes

Model Enumerations and Utilities

Enumerations for supported model classes and utility functions for discovering available models and their capabilities.

class EmbedderModelClass(Enum):
    ENCODER_ONLY_BASE = "encoder-only-base"
    ENCODER_ONLY_M3 = "encoder-only-m3"
    DECODER_ONLY_BASE = "decoder-only-base"
    DECODER_ONLY_ICL = "decoder-only-icl"

class RerankerModelClass(Enum):
    ENCODER_ONLY_BASE = "encoder-only-base"
    DECODER_ONLY_BASE = "decoder-only-base"
    DECODER_ONLY_LAYERWISE = "decoder-only-layerwise"
    DECODER_ONLY_LIGHTWEIGHT = "decoder-only-lightweight"

def support_model_list() -> List[str]: ...
def support_native_bge_model_list() -> List[str]: ...

Model Types and Utilities

Supported Models

FlagEmbedding supports a comprehensive range of pre-trained models:

BGE Models: bge-m3, bge-large-en-v1.5, bge-base-en-v1.5, bge-small-en-v1.5, bge-large-zh-v1.5, bge-multilingual-gemma2, bge-en-icl
E5 Models: e5-mistral-7b-instruct, e5-large-v2, e5-base-v2, multilingual-e5-large-instruct, multilingual-e5-large
GTE Models: gte-Qwen2-7B-instruct, gte-Qwen2-1.5B-instruct, gte-large-en-v1.5, gte-base-en-v1.5, gte-multilingual-base
Reranker Models: bge-reranker-base, bge-reranker-large, bge-reranker-v2-m3, bge-reranker-v2-gemma, bge-reranker-v2-minicpm-layerwise

Common Types

from typing import List, Union, Optional, Dict, Any, Tuple
import torch
import numpy as np

# Core types used across the API
QueryType = Union[str, List[str]]
CorpusType = Union[str, List[str]]
EmbeddingOutput = Union[torch.Tensor, np.ndarray]
SentencePair = Tuple[str, str]
DeviceSpec = Union[str, List[str]]

Version

Tile

Files

tessl/pypi-flagembedding

To install, run

index.md.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}docs/

FlagEmbedding

Package Information

Core Imports

Basic Usage

Architecture

Capabilities

Auto Model Factory

Encoder-Only Embedders

Decoder-Only Embedders

Reranking Models

Base Classes and Interfaces

Model Enumerations and Utilities

Supported Models

Common Types

index.mddocs/