Functions

November 14, 2025 · View on GitHub

By default, functions are installed into the public schema. You can choose an alternate location by running CREATE EXTENSION pg_duckdb WITH SCHEMA your_schema_name.

Note: ALTER EXTENSION is not currently supported for moving the extension to a different schema.

Data Lake Functions

Name	Description
`read_parquet`	Read a Parquet file
`read_csv`	Read a CSV file
`read_json`	Read a JSON file
`iceberg_scan`	Read an Iceberg dataset
`iceberg_metadata`	Read Iceberg metadata
`iceberg_snapshots`	Read Iceberg snapshot information
`delta_scan`	Read a Delta dataset

JSON Functions

All of the DuckDB json functions and aggregates. Postgres JSON/JSONB functions are not supported.

Union Type Functions

Name	Description
`union_extract`	Extracts a value from a union type by tag name.
`union_tag`	Gets the tag name of the active member in a union type.

MAP Functions

All of the DuckDB map functions.

Name	Description
`cardinality`	Return the size of the map
`element_at`	Return the value for a given key as a list
`map_concat`	Merge multiple maps
`map_contains`	Check if a map contains a given key
`map_contains_entry`	Check if a map contains a given key-value pair
`map_contains_value`	Check if a map contains a given value
`map_entries`	Return a list of struct(k, v) for each key-value pair
`map_extract`	Extract a value from a map using a key
`map_extract_value`	Return the value for a given key or NULL
`map_from_entries`	Create a map from an array of struct(k, v)
`map_keys`	Get all keys from a map as a list
`map_values`	Get all values from a map as a list

Aggregates

Name	Description
`approx_count_distinct`	Approximates the count of distinct elements using HyperLogLog.

Sampling Functions

Name	Description
`TABLESAMPLE`	Samples a subset of rows from a table or query result.

Time Functions

Name	Description
`time_bucket`	Buckets timestamps into time intervals for time-series analysis.
`strftime`	Formats timestamps as strings using format codes.
`strptime`	Parses strings into timestamps using format codes.
`epoch`	Converts timestamps to Unix epoch seconds.
`epoch_ms`	Converts timestamps to Unix epoch milliseconds.
`epoch_us`	Converts timestamps to Unix epoch microseconds.
`epoch_ns`	Converts timestamps to Unix epoch nanoseconds.
`make_timestamp`	Creates a timestamp from microseconds since epoch.
`make_timestamptz`	Creates a timestamp with timezone from microseconds since epoch.

DuckDB Administration Functions

Name	Description
`duckdb.install_extension`	Installs a DuckDB extension.
`duckdb.load_extension`	Loads a DuckDB extension for the current session.
`duckdb.autoload_extension`	Configures whether an extension should be auto-loaded.
`duckdb.query`	Runs a `SELECT` query directly against DuckDB.
`duckdb.raw_query`	Runs any query directly against DuckDB (for debugging).
`duckdb.recycle_ddb`	Resets the DuckDB instance in the current connection (for debugging).

Secrets Management Functions

Name	Description
`duckdb.create_simple_secret`	Creates a simple secret for cloud storage access.
`duckdb.create_azure_secret`	Creates an Azure secret using a connection string.

Motherduck Functions

Name	Description
`duckdb.enable_motherduck`	Enables MotherDuck integration with a token.
`duckdb.is_motherduck_enabled`	Checks if MotherDuck integration is enabled.
`duckdb.force_motherduck_sync`	Forces a full resync of MotherDuck databases and schemas to Postgres (for debugging).

Detailed Descriptions

`read_parquet(path TEXT or TEXT[], ...)` -> `SETOF duckdb.row`

Reads a parquet file, either from a remote location (via httpfs) or a local file.

This returns DuckDB rows, you can expand them using * or you can select specific columns using the r['mycol'] syntax. If you want to select specific columns you should give the function call an easy alias, like r. For example:

SELECT * FROM read_parquet('file.parquet');
SELECT r['id'], r['name'] FROM read_parquet('file.parquet') r WHERE r['age'] > 21;
SELECT COUNT(*) FROM read_parquet('file.parquet');

Further information:

Name	Type	Default	Description
allowed_moved_paths	boolean	false	Ensures that some path resolution is performed, which allows scanning Iceberg tables that are moved.
mode	text	`''`
metadata_compression_codec	text	`'none'`
skip_schema_inference	boolean	false
version	text	`'version-hint.text'`
version_name_format	text	`'v%s%s.metadata.json,%s%s.metadata.json'`

Name	Type	Description
extension_name	text	The name of the extension to configure
autoload	boolean	Whether the extension should be auto-loaded

Name	Type	Description
type	text	The type of secret ('S3', 'GCS', 'R2', etc.)
key_id	text	The access key ID or equivalent
secret	text	The secret key or equivalent
region	text	The region for the service

Name	Type	Description
session_token	text	Session token for temporary credentials
endpoint	text	Custom endpoint URL
url_style	text	URL style ('vhost' or 'path')
use_ssl	text	Whether to use SSL ('true' or 'false')
scope	text	Scope for the secret (default: '')

Name	Type	Description
bucket_width	interval	The interval size for bucketing (e.g., '1 hour', '15 minutes')
timestamp_col	timestamp	The timestamp column to bucket

Name	Type	Description
timestamp_expr	timestamp	The timestamp value to format
format_string	text	The format string with format codes

Name	Type	Description
string_expr	text	The string to parse as a timestamp
format_string	text	The format string describing the input format

Name	Type	Description
map_col	duckdb.map	The map to extract from
key	duckdb.unresolved_type	The key to look up in the map

Name	Type	Description
map_col	duckdb.map	The first map
map_col2	duckdb.map	The second map to merge

Name	Type	Description
map_col	duckdb.map	The map to check
key	duckdb.unresolved_type	The key to search for

Name	Type	Description
sampling_method	keyword	Either `SYSTEM` or `BERNOULLI`
percentage	numeric	Percentage of rows to sample (0-100)

Name	Type	Description
union_col	duckdb.union or duckdb.unresolved_type	The union column to extract from
tag	text	The tag name of the union member to extract

read_parquet(path TEXT or TEXT[], ...) -> SETOF duckdb.row

read_csv(path TEXT or TEXT[], ...) -> SETOF duckdb.row

read_json(path TEXT or TEXT[], ...) -> SETOF duckdb.row

iceberg_scan(path TEXT, ...) -> SETOF duckdb.row

iceberg_metadata(path TEXT, ...) -> SETOF iceberg_metadata_record

iceberg_snapshots(path TEXT, ...) -> SETOF iceberg_snapshot_record

delta_scan(path TEXT) -> SETOF duckdb.row

duckdb.install_extension(extension_name TEXT, repository TEXT DEFAULT 'core') -> bool

duckdb.load_extension(extension_name TEXT) -> void

duckdb.autoload_extension(extension_name TEXT, autoload BOOLEAN) -> void

duckdb.query(query TEXT) -> SETOF duckdb.row

duckdb.raw_query(query TEXT) -> void

duckdb.recycle_ddb() -> void

duckdb.enable_motherduck(token TEXT, database_name TEXT) -> void

duckdb.is_motherduck_enabled() -> boolean

duckdb.create_simple_secret(type TEXT, key_id TEXT, secret TEXT, region TEXT, ...) -> void

duckdb.create_azure_secret(connection_string TEXT, scope TEXT DEFAULT '') -> TEXT

duckdb.force_motherduck_sync(drop_with_cascade BOOLEAN DEFAULT false)

time_bucket(bucket_width INTERVAL, timestamp_col TIMESTAMP, origin TIMESTAMP) -> TIMESTAMP

strftime(timestamp_expr, format_string) -> TEXT

strptime(string_expr, format_string) -> TIMESTAMP

epoch(timestamp_expr) -> BIGINT

map_extract(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type

map_keys(map_col duckdb.map) -> duckdb.unresolved_type

map_values(map_col duckdb.map) -> duckdb.unresolved_type

cardinality(map_col duckdb.map) -> numeric

element_at(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type

map_concat(map_col duckdb.map, map_col2 duckdb.map) -> duckdb.map

map_contains(map_col duckdb.map, key duckdb.unresolved_type) -> boolean

map_contains_entry(map_col duckdb.map, key duckdb.unresolved_type, value duckdb.unresolved_type) -> boolean

map_contains_value(map_col duckdb.map, value duckdb.unresolved_type) -> boolean

map_entries(map_col duckdb.map) -> duckdb.struct[]

map_extract_value(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type

map_from_entries(entries duckdb.struct[]) -> duckdb.map

epoch_ms(timestamp_expr) -> BIGINT

epoch_us(timestamp_expr) -> BIGINT

epoch_ns(timestamp_expr) -> BIGINT

make_timestamp(microseconds) -> TIMESTAMP

make_timestamptz(microseconds) -> TIMESTAMPTZ

TABLESAMPLE (sampling_method(percentage | rows))

union_extract(union_col, tag) -> duckdb.unresolved_type

union_tag(union_col) -> duckdb.unresolved_type

approx_count_distinct(expression) -> BIGINT

`read_parquet(path TEXT or TEXT[], ...)` -> `SETOF duckdb.row`

`read_csv(path TEXT or TEXT[], ...)` -> `SETOF duckdb.row`

`read_json(path TEXT or TEXT[], ...)` -> `SETOF duckdb.row`

`iceberg_scan(path TEXT, ...)` -> `SETOF duckdb.row`

`iceberg_metadata(path TEXT, ...)` -> `SETOF iceberg_metadata_record`

`iceberg_snapshots(path TEXT, ...)` -> `SETOF iceberg_snapshot_record`

`delta_scan(path TEXT)` -> `SETOF duckdb.row`

`duckdb.install_extension(extension_name TEXT, repository TEXT DEFAULT 'core')` -> `bool`

`duckdb.load_extension(extension_name TEXT)` -> `void`

`duckdb.autoload_extension(extension_name TEXT, autoload BOOLEAN)` -> `void`

`duckdb.query(query TEXT)` -> `SETOF duckdb.row`

`duckdb.raw_query(query TEXT)` -> `void`

`duckdb.recycle_ddb()` -> `void`

`duckdb.enable_motherduck(token TEXT, database_name TEXT)` -> `void`

`duckdb.is_motherduck_enabled()` -> `boolean`

`duckdb.create_simple_secret(type TEXT, key_id TEXT, secret TEXT, region TEXT, ...)` -> `void`

`duckdb.create_azure_secret(connection_string TEXT, scope TEXT DEFAULT '')` -> `TEXT`

`duckdb.force_motherduck_sync(drop_with_cascade BOOLEAN DEFAULT false)`

`time_bucket(bucket_width INTERVAL, timestamp_col TIMESTAMP, origin TIMESTAMP)` -> `TIMESTAMP`

`strftime(timestamp_expr, format_string)` -> `TEXT`

`strptime(string_expr, format_string)` -> `TIMESTAMP`

`epoch(timestamp_expr)` -> `BIGINT`

`map_extract(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type`

`map_keys(map_col duckdb.map) -> duckdb.unresolved_type`

`map_values(map_col duckdb.map) -> duckdb.unresolved_type`

`cardinality(map_col duckdb.map) -> numeric`

`element_at(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type`

`map_concat(map_col duckdb.map, map_col2 duckdb.map) -> duckdb.map`

`map_contains(map_col duckdb.map, key duckdb.unresolved_type) -> boolean`

`map_contains_entry(map_col duckdb.map, key duckdb.unresolved_type, value duckdb.unresolved_type) -> boolean`

`map_contains_value(map_col duckdb.map, value duckdb.unresolved_type) -> boolean`

`map_entries(map_col duckdb.map) -> duckdb.struct[]`

`map_extract_value(map_col duckdb.map, key duckdb.unresolved_type) -> duckdb.unresolved_type`

`map_from_entries(entries duckdb.struct[]) -> duckdb.map`

`epoch_ms(timestamp_expr)` -> `BIGINT`

`epoch_us(timestamp_expr)` -> `BIGINT`

`epoch_ns(timestamp_expr)` -> `BIGINT`

`make_timestamp(microseconds)` -> `TIMESTAMP`

`make_timestamptz(microseconds)` -> `TIMESTAMPTZ`

`TABLESAMPLE (sampling_method(percentage | rows))`

`union_extract(union_col, tag)` -> `duckdb.unresolved_type`

`union_tag(union_col)` -> `duckdb.unresolved_type`

`approx_count_distinct(expression)` -> `BIGINT`