Serverless Aura Graph Analytics (AGA) GDS Sessions — covers GdsSessions, AuraGraphDataScience, AuraAPICredentials, DbmsConnectionInfo, SessionMemory, get_or_create, remote graph projection with gds.v2.graph.project and gds.graph.project.remote, gds.v2 session endpoints, gds.v2.graph.construct, AuraDB Cypher API memory/sessionId projection, algorithms, write-back, and session lifecycle. Use for AuraDB-connected, self-managed Neo4j, or standalone DataFrame/Spark session workloads. Does NOT cover the embedded GDS plugin on Aura Pro or self-managed Neo4j — use neo4j-gds-skill. Does NOT handle Cypher authoring — use neo4j-cypher-skill. Does NOT cover Snowflake Graph Analytics — use neo4j-snowflake-graph-analytics-skill.
69
85%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
GdsSessions or using AuraGraphDataSciencegds.graph.project.remote(...){ memory: ... } or { sessionId: ... }neo4j-gds-skillneo4j-gds-skillneo4j-cypher-skillneo4j-snowflake-graph-analytics-skill| Deployment | Use |
|---|---|
| Aura Free | ❌ AGA not available |
| Aura Pro | neo4j-gds-skill (embedded plugin) |
| AuraDB + Python client sessions | this skill |
| AuraDB + Cypher API | this skill for AGA-specific projection/session notes; neo4j-cypher-skill for query authoring |
| Self-managed Neo4j + AGA session | this skill |
| Self-managed Neo4j + embedded plugin | neo4j-gds-skill |
| Non-Neo4j data (Pandas, Spark) | this skill (standalone mode) |
graphdatascience >= 1.15 required; >= 1.18 for Sparkgds.v2.graph.project(...), gds.v2.page_rank.*, gds.v2.graph.node_properties.*gds.v2.verify_session_connectivity() after session creationgds.v2.verify_db_connectivity() when source DB access requiredgds.delete() or sessions.delete(name) stops billingAuraAPICredentials.from_env() — never hardcode credentialspip install "graphdatascience>=1.15"import os
from graphdatascience.session import AuraAPICredentials, GdsSessions
sessions = GdsSessions(api_credentials=AuraAPICredentials.from_env())
# Reads: AURA_CLIENT_ID, AURA_CLIENT_SECRET, AURA_PROJECT_ID (optional)
# Create API credentials in Aura Console → Account → API credentialsIf member of multiple projects: set AURA_PROJECT_ID or pass project_id=.
from graphdatascience.session import AlgorithmCategory, SessionMemory
memory = sessions.estimate(
node_count=1_000_000,
relationship_count=5_000_000,
algorithm_categories=[
AlgorithmCategory.CENTRALITY,
AlgorithmCategory.NODE_EMBEDDING,
AlgorithmCategory.COMMUNITY_DETECTION,
],
)
# Returns SessionMemory tier, e.g. SessionMemory.m_8GB
# Fixed tiers: m_2GB … m_256GB — see references/limitations.mdMode A — AuraDB connected:
from graphdatascience.session import DbmsConnectionInfo, SessionMemory, CloudLocation
from datetime import timedelta
db_connection = DbmsConnectionInfo(
username=os.environ["NEO4J_USERNAME"],
password=os.environ["NEO4J_PASSWORD"],
aura_instance_id=os.environ["AURA_INSTANCEID"], # from Aura Console URL
)
gds = sessions.get_or_create(
session_name="my-analysis",
memory=memory,
db_connection=db_connection,
ttl=timedelta(hours=2),
)
gds.v2.verify_session_connectivity()
gds.v2.verify_db_connectivity()Mode B — Self-managed Neo4j:
db_connection = DbmsConnectionInfo(
uri=os.environ["NEO4J_URI"], # e.g. "bolt://my-server:7687"
username=os.environ["NEO4J_USERNAME"],
password=os.environ["NEO4J_PASSWORD"],
)
gds = sessions.get_or_create(
session_name="my-analysis-sm",
memory=SessionMemory.m_8GB,
db_connection=db_connection,
ttl=timedelta(hours=2),
cloud_location=CloudLocation("gcp", "europe-west1"),
)
gds.v2.verify_session_connectivity()
gds.v2.verify_db_connectivity()Mode C — Standalone (no Neo4j DB):
gds = sessions.get_or_create(
session_name="my-standalone",
memory=SessionMemory.m_4GB,
ttl=timedelta(hours=1),
cloud_location=CloudLocation("gcp", "europe-west1"),
)
gds.v2.verify_session_connectivity()get_or_create() is idempotent; reconnects to existing session by name.
From connected Neo4j (remote projection):
query = """
CALL () {
MATCH (p:Person)
OPTIONAL MATCH (p)-[r:KNOWS]->(p2:Person)
RETURN p AS source, r AS rel, p2 AS target,
p {.age, .score} AS sourceNodeProperties,
p2 {.age, .score} AS targetNodeProperties
}
RETURN gds.graph.project.remote(source, target, {
sourceNodeLabels: labels(source),
targetNodeLabels: labels(target),
sourceNodeProperties: sourceNodeProperties,
targetNodeProperties: targetNodeProperties,
relationshipType: type(rel)
})
"""
G, result = gds.v2.graph.project(
graph_name="my-graph",
query=query,
undirected_relationship_types=["KNOWS"],
)
print(f"Projected {G.node_count()} nodes, {G.relationship_count()} relationships")CALL () { ... } required for multi-pattern MATCH. Use UNION inside CALL for multiple labels/rel types.
Remote query uses gds.graph.project.remote(...); pass graph name to gds.v2.graph.project(...), not query.
V1 fallback: gds.graph.project(graph_name="my-graph", query=query, undirected_relationship_types=["KNOWS"]).
AuraDB Cypher API projection:
CYPHER runtime=parallel
MATCH (source)
OPTIONAL MATCH (source)-->(target)
RETURN gds.graph.project(
'my-graph',
source,
target,
{},
{ memory: '2GB' }
)Existing explicit session:
CYPHER runtime=parallel
MATCH (source)
OPTIONAL MATCH (source)-->(target)
RETURN gds.graph.project(
'my-graph',
source,
target,
{},
{ sessionId: '00000000-11111111' }
)Cypher API uses gds.graph.project(...), not gds.graph.project.remote(...). Put memory, ttl, sessionId, batchSize in fifth config argument.
Session management via Cypher API:
CALL gds.session.getOrCreate('test-session', '2GB', duration({minutes: 30}))
YIELD id, name, status
RETURN id, name, status
CALL gds.session.list()
YIELD id, name, status, memory
RETURN id, name, status, memoryImplicit Cypher API sessions delete when all projected graphs in session are dropped.
From Pandas DataFrames (standalone mode):
import pandas as pd
nodes_df = pd.DataFrame([
{"nodeId": 0, "labels": "Person", "age": 30},
{"nodeId": 1, "labels": "Person", "age": 25},
])
rels_df = pd.DataFrame([
{"sourceNodeId": 0, "targetNodeId": 1, "relationshipType": "KNOWS"},
])
G = gds.v2.graph.construct("my-graph", nodes_df, rels_df)
# Multiple DataFrames: gds.v2.graph.construct("g", [nodes1, nodes2], [rels1, rels2])Required columns — nodes: nodeId (int), labels (str). Relationships: sourceNodeId, targetNodeId, relationshipType. Drop string node properties before construct().
# Mutate — chain results without writing to DB
gds.v2.page_rank.mutate(G, mutate_property="pagerank", damping_factor=0.85)
gds.v2.fast_rp.mutate(G,
mutate_property="embedding",
embedding_dimension=128,
feature_properties=["pagerank"],
random_seed=42,
)
# Stream — inspect results as DataFrame
df = gds.v2.page_rank.stream(G)
print(df.sort_values("score", ascending=False).head(10))
# Write — persist to connected Neo4j DB (connected modes only)
gds.v2.louvain.write(G, write_property="community")V1 fallback: gds.pageRank.mutate(..., mutateProperty="pagerank"). Plugin algorithm reference → neo4j-gds-skill; AGA limitations differ.
Long-running algorithms may return job handle. Poll until done:
import time
job = gds.v2.page_rank.mutate(G, mutate_property="pagerank")
# If job object returned (async mode), poll explicitly:
if hasattr(job, "status"):
while job.status() not in ("RUNNING_DONE", "FAILED", "CANCELLED"):
time.sleep(5)
print(f"Job status: {job.status()}")
if job.status() != "RUNNING_DONE":
raise RuntimeError(f"Algorithm job failed: {job.status()}")Large graphs: check .status() before reading results.
Non-blocking API [graphdatascience 1.22]: _async projection variants return immediately; compute methods return a JobHandle, write-back returns a WriteJobHandle. List/retrieve running jobs:
gds.v2.jobs.list() # all jobs in session
job = gds.v2.jobs.get(job_id)# Stream node properties
result_df = gds.v2.graph.node_properties.stream(
G,
node_properties=["pagerank", "embedding"],
db_node_properties=["name"], # connected modes only
)
result_df.head(10)Standalone mode: no db_node_properties; join source DataFrame:
result_df = gds.v2.graph.node_properties.stream(G, ["pagerank"])
result_df.merge(nodes_df[["nodeId", "name"]], how="left")# Write node properties to connected Neo4j
gds.v2.graph.node_properties.write(G, ["pagerank", "embedding"])
# Write relationship properties
gds.v2.graph.relationships.write(G, "SIMILAR", ["score"])
# Query connected DB from session
gds.run_cypher("MATCH (n:Person) RETURN count(n)")
# Drop projected graph
gds.v2.graph.drop(G)
# Delete session
sessions.delete(session_name="my-analysis")
# or: gds.delete()Write before delete; unwritten results lost when session closes.
# List active sessions
from pandas import DataFrame
DataFrame(sessions.list())
# Reconnect to existing session
gds = sessions.get_or_create(session_name="my-analysis", memory=..., db_connection=...)| Error | Cause | Fix |
|---|---|---|
AuthenticationError / 401 | Wrong CLIENT_ID/CLIENT_SECRET | Regenerate in Aura Console → Account → API credentials |
SessionNotFoundError | Session expired (TTL exceeded) or name typo | sessions.list() to check; recreate session |
GraphNotFoundError | Projection dropped or session reconnected without re-projecting | Re-run gds.v2.graph.project() or gds.v2.graph.construct() |
Algorithm job FAILED | Memory limit exceeded or unsupported algorithm | Increase SessionMemory; check topological link prediction not used |
MemoryEstimationExceeded | Graph larger than estimated | Re-estimate with actual counts; pick next tier up |
| Results empty after session reconnect | Results not written before session was closed | Always write/stream before gds.delete() |
String node properties not supported | String column in nodes DataFrame | Drop string columns before gds.v2.graph.construct() |
AGA not enabled for project | AGA feature not activated | Enable in Aura Console → project settings |
Load on demand:
| Need | URL |
|---|---|
| AGA Python client docs | https://neo4j.com/docs/graph-data-science-client/current/aura-graph-analytics/ |
| AGA Cypher API docs | https://neo4j.com/docs/graph-data-science/current/aura-graph-analytics/cypher/ |
| Python client v2 docs | https://neo4j.com/docs/graph-data-science-client/current/v2_endpoints/ |
| AuraDB tutorial notebook | https://github.com/neo4j/graph-data-science-client/blob/main/examples/graph-analytics-serverless.ipynb |
| GDS algorithm reference | https://neo4j.com/docs/graph-data-science/current/algorithms/ |
AURA_CLIENT_ID, AURA_CLIENT_SECRET)sessions.estimate(...))gds.v2.verify_session_connectivity() called after session creationgds.v2.verify_db_connectivity() when source DB access requiredgds.v2.graph.project(..., query) with gds.graph.project.remote(...) inside querymemory or sessionIdgds.session.getOrCreate(...); implicit sessions dropped with projected graphRUNNING_DONE before reading resultssessions.delete(...) or gds.delete())6d44d31
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.