std/utf8

March 13, 2026 · View on GitHub

The std/utf8 module provides utilities for working with Unicode code points (rune type) and UTF-8 encoding.

Usage

import "std/utf8.zc"

fn main() {
    let r = 'ñ';
    
    if (Utf8::is_alpha(r)) {
        println "{r} is a letter";
    }
}

Method	Signature	Description
is_digit	`is_digit(r: rune) -> bool`	Returns true if the rune is a decimal digit (0-9).
is_alpha	`is_alpha(r: rune) -> bool`	Returns true if the rune is an alphabetic character.
is_whitespace	`is_whitespace(r: rune) -> bool`	Returns true if the rune is a whitespace character.
is_upper	`is_upper(r: rune) -> bool`	Returns true if the rune is an uppercase letter.
is_lower	`is_lower(r: rune) -> bool`	Returns true if the rune is a lowercase letter.
is_valid	`is_valid(data: char*, len: usize) -> bool`	Returns true if the buffer contains valid UTF-8 data.

Method	Signature	Description
to_upper	`to_upper(r: rune) -> rune`	Returns the uppercase version of the rune.
to_lower	`to_lower(r: rune) -> rune`	Returns the lowercase version of the rune.

Method	Signature	Description
encode	`encode(r: rune, buf: char*) -> usize`	Encodes a rune into UTF-8. Returns bytes written (1-4).
rune_len	`rune_len(r: rune) -> usize`	Returns number of bytes required to encode the rune.
decode	`decode(data: char, len: usize, consumed: usize) -> rune`	Decodes the first UTF-8 sequence from data.