tessl install tessl/pypi-kedro@1.1.0Kedro helps you build production-ready data and analytics pipelines
Agent Success
Agent success rate when using this tile
98%
Improvement
Agent success rate improvement when using this tile compared to baseline
1.32x
Baseline
Agent success rate without this tile
74%
A utility for extracting specific subsets of a data pipeline based on starting points, ending points, and data dependencies.
Given a data pipeline with multiple nodes, extract all nodes that are downstream (dependent on) a specified set of starting nodes, including the starting nodes themselves.
Given a data pipeline with multiple nodes, extract all nodes that are upstream (prerequisites for) a specified set of target nodes, including the target nodes themselves.
Given a data pipeline with nodes consuming various datasets, extract all nodes that depend on specific input datasets (directly or transitively).
Given a data pipeline with nodes producing various datasets, extract all nodes required to produce specific output datasets.
@generates
def extract_downstream_from_nodes(pipeline, node_names):
"""
Extract a pipeline subset containing specified nodes and all their downstream dependencies.
Args:
pipeline: A Kedro Pipeline object
node_names: A list of node names to use as starting points
Returns:
A new Pipeline object containing only the selected nodes
"""
pass
def extract_upstream_to_nodes(pipeline, node_names):
"""
Extract a pipeline subset containing specified nodes and all their upstream dependencies.
Args:
pipeline: A Kedro Pipeline object
node_names: A list of node names to use as target points
Returns:
A new Pipeline object containing only the selected nodes
"""
pass
def extract_from_inputs(pipeline, input_names):
"""
Extract a pipeline subset containing all nodes that depend on specified input datasets.
Args:
pipeline: A Kedro Pipeline object
input_names: A list of dataset names to use as starting inputs
Returns:
A new Pipeline object containing only the selected nodes
"""
pass
def extract_to_outputs(pipeline, output_names):
"""
Extract a pipeline subset containing all nodes required to produce specified outputs.
Args:
pipeline: A Kedro Pipeline object
output_names: A list of dataset names to use as target outputs
Returns:
A new Pipeline object containing only the selected nodes
"""
passProvides pipeline construction and manipulation capabilities.
@satisfied-by