tessl/pypi-gdown

Google Drive Public File/Folder Downloader that bypasses security notices and provides recursive folder downloads

—

Pending

Quality

Pending

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

Overview

Eval results

Files

File Download Operations

Name: tessl/pypi-gdown
Author: tessl

Core file downloading functionality with comprehensive Google Drive support, format conversion, and intelligent URL handling.

Capabilities

Primary Download Function

Downloads files from URLs with extensive Google Drive support, including security bypass, authentication, and format conversion.

def download(
    url=None,
    output=None,
    quiet=False,
    proxy=None,
    speed=None,
    use_cookies=True,
    verify=True,
    id=None,
    fuzzy=False,
    resume=False,
    format=None,
    user_agent=None,
    log_messages=None
) -> str:
    """
    Download file from URL with Google Drive support.

    Parameters:
    - url (str): URL to download from. Google Drive URLs supported.
    - output (str): Output filename/directory. Defaults to basename of URL.
                   If ends with '/', treated as parent directory.
    - quiet (bool): Suppress terminal output. Default: False.
    - proxy (str): Proxy configuration in format 'protocol://host:port'.
    - speed (float): Download speed limit in bytes per second (e.g., 256*1024 for 256KB/s).
    - use_cookies (bool): Use cookies from ~/.cache/gdown/cookies.txt. Default: True.
    - verify (bool/str): TLS certificate verification. True/False or path to CA bundle. Default: True.
    - id (str): Google Drive file ID. Cannot be used with url parameter.
    - fuzzy (bool): Extract Google Drive file ID from any URL format. Default: False.
    - resume (bool): Resume interrupted downloads, skip completed files. Default: False.
    - format (str): Export format for Google Docs/Sheets/Slides.
                   Defaults: Docs='docx', Sheets='xlsx', Slides='pptx'.
    - user_agent (str): Custom user agent string.
    - log_messages (dict): Custom messages with keys 'start' and 'output'.

    Returns:
    str: Path to downloaded file.

    Raises:
    FileURLRetrievalError: When unable to retrieve download URL.
    ValueError: When both url and id are specified or neither.
    """

Usage Examples

Basic File Download

import gdown

# Download with URL
url = "https://drive.google.com/uc?id=1l_5RK28JRL19wpT22B-DY9We3TVXnnQQ"
output_path = gdown.download(url, "my_file.npz")
print(f"Downloaded to: {output_path}")

Download with File ID

# Direct file ID usage
file_id = "1l_5RK28JRL19wpT22B-DY9We3TVXnnQQ"
output_path = gdown.download(id=file_id, output="my_file.npz")

Fuzzy URL Matching

# Copy-paste any Google Drive URL format
messy_url = "https://drive.google.com/file/d/1l_5RK28JRL19wpT22B-DY9We3TVXnnQQ/view?usp=sharing&resourcekey=0-WWs_XOSctfaY_0-sJBKRSQ"
gdown.download(messy_url, "output.npz", fuzzy=True)

Google Workspace Document Export

# Download Google Doc as PDF
doc_url = "https://docs.google.com/document/d/DOCUMENT_ID/edit"
gdown.download(doc_url, "document.pdf", format="pdf")

# Download Google Sheet as CSV
sheet_url = "https://docs.google.com/spreadsheets/d/SHEET_ID/edit"
gdown.download(sheet_url, "data.csv", format="csv")

# Download Google Slides as PowerPoint
slides_url = "https://docs.google.com/presentation/d/PRESENTATION_ID/edit"
gdown.download(slides_url, "presentation.pptx", format="pptx")

Advanced Options

# Resume interrupted download with speed limit
gdown.download(
    url="https://drive.google.com/uc?id=LARGE_FILE_ID",
    output="large_file.zip",
    resume=True,
    speed=1024*1024,  # 1MB/s limit
    proxy="http://proxy.company.com:8080"
)

# Download to stdout (for piping)
import sys
gdown.download(url, output=sys.stdout.buffer, quiet=True)

Custom Progress Messages

# Custom logging messages
custom_messages = {
    "start": "🚀 Starting custom download...\n",
    "output": "📁 Saving to: {}\n"
}

gdown.download(
    url="https://drive.google.com/uc?id=FILE_ID",
    output="file.zip",
    log_messages=custom_messages
)

Error Handling

from gdown.exceptions import FileURLRetrievalError

try:
    gdown.download("https://drive.google.com/uc?id=INVALID_ID", "output.zip")
except FileURLRetrievalError as e:
    print(f"Download failed: {e}")
    # Handle permission issues, network errors, etc.

Supported URL Formats

The download function supports various Google Drive URL formats:

Direct download: https://drive.google.com/uc?id=FILE_ID
File view: https://drive.google.com/file/d/FILE_ID/view?usp=sharing
Document edit: https://docs.google.com/document/d/DOC_ID/edit
Spreadsheet edit: https://docs.google.com/spreadsheets/d/SHEET_ID/edit
Presentation edit: https://docs.google.com/presentation/d/PRESENTATION_ID/edit
Non-Google URLs: Any HTTP/HTTPS URL (functions as alternative to curl/wget)

Configuration Files

Cookie Management

Place cookies in ~/.cache/gdown/cookies.txt (Mozilla/Netscape format) for authenticated downloads:

# Netscape HTTP Cookie File
.google.com	TRUE	/	FALSE	0	cookie_name	cookie_value

User Agent Strings

Default user agents:

File downloads: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36
Custom: Specify via user_agent parameter

Install with Tessl CLI