Skip to content

textproc

CLI layer implementation for TextProc/Fix.

The following capabilities are provided:

  • TextProc/Fix is a general-purpose data cleaning and normalization operations.

Attributes

Classes

TextProc

TextProc()

Bases: BaseCLI

TextProc CLI class

Functions

fix
fix(
    input_file: str,
    profile: str = "textproc_fix_formatting_generic",
    output_dir: str | None = None,
    additional_user_info: str | None = None,
) -> None

Execute TextProc fix on a document.

Parameters:

  • input_file (str) –

    The path to the input file to be processed.

  • profile (str, default: 'textproc_fix_formatting_generic' ) –

    The profile to use for formatting.

  • output_dir (str | None, default: None ) –

    The directory to save the output file.

  • additional_user_info (str | None, default: None ) –

    Additional information provided by the user.

name classmethod
name() -> str