🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "normalization"

Found 23 matching component(s)

  • function clean_text

    Cleans and normalizes text content by removing HTML tags, normalizing whitespace, and stripping markdown formatting elements.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    text-processing text-cleaning normalization html-removal markdown-removal
  • function extract_warranty_data_improved

    Parses markdown-formatted warranty documentation to extract structured warranty data including IDs, titles, sections, disclosure text, and reference citations.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    markdown-parsing text-extraction warranty-processing document-parsing regex
  • function parse_email_address

    Parses email address strings by handling multiple addresses separated by semicolons and converting them to comma-separated format.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email parsing string-manipulation formatting address-normalization
  • function matches_source_filter

    Checks if a document path matches any of the provided source filters using exact match, folder prefix match, path component sequence match, or filename match.

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    path-matching file-filtering document-filtering path-normalization string-matching
  • class SignatureManager

    A class that manages digital signature images for documents, providing functionality to store, retrieve, and list signature files in a designated directory.

    File: /tf/active/vicechatdev/document_auditor/src/security/signature_manager.py

    signature-management document-processing file-management image-processing digital-signatures
  • function hash_text

    Creates a SHA-256 hash of normalized text content to generate a unique identifier for documents, enabling duplicate detection and content comparison.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    hashing text-processing deduplication content-fingerprinting sha256
  • function clean_html_tags

    Removes HTML tags and entities from text strings, returning clean plain text suitable for PDF display or other formatted output.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    html text-processing sanitization string-manipulation pdf-generation
  • function convert_european_decimals

    Detects and converts numeric data with European decimal format (comma as decimal separator) to standard format (dot as decimal separator) in a pandas DataFrame, handling mixed formats and missing data patterns.

    File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

    data-processing data-cleaning decimal-conversion european-format locale-handling
  • function calculate_cv

    Calculates the coefficient of variation (CV) for a dataset, expressed as a percentage of the standard deviation relative to the mean.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/d48d7789-9627-4e96-9f48-f90b687cd07d/analysis_1.py

    statistics coefficient-of-variation data-analysis variability dispersion
  • function get_document_type_code

    Retrieves a document type code from a dictionary lookup using the provided document type name, returning the name itself if no mapping exists.

    File: /tf/active/vicechatdev/CDocs/settings_prod.py

    lookup mapping document-type code-conversion dictionary-lookup
  • function extract_metadata_from_filecloud

    Extracts and normalizes metadata from FileCloud for document creation, providing default values and generating document numbers when needed.

    File: /tf/active/vicechatdev/CDocs/FC_sync.py

    metadata normalization document-management filecloud data-extraction
  • function get_documents

    Retrieves controlled documents from a Neo4j database with comprehensive filtering, permission-based access control, pagination, and full-text search capabilities.

    File: /tf/active/vicechatdev/CDocs/controllers/document_controller.py

    document-management neo4j graph-database filtering pagination
  • function normalize_uid

    Normalizes a UID value by converting it to a string, or generates a new UUID v4 if the input is empty or None.

    File: /tf/active/vicechatdev/CDocs/utils/uid_helper.py

    uid uuid normalization database identifier
  • function _process_event_record

    Processes a single event record from a database by normalizing timestamps, adding user information, and parsing JSON details fields.

    File: /tf/active/vicechatdev/CDocs/utils/audit_trail.py

    data-processing event-processing database timestamp-normalization json-parsing
  • function _origin_url

    Removes the protocol scheme (http/https) from a URL string, returning only the domain and path portion.

    File: /tf/active/vicechatdev/patches/server.py

    url-processing string-manipulation protocol-removal url-normalization web-utilities
  • function process_ellipses

    Expands an Ellipsis (...) in a __getitem__ key by replacing it with the appropriate number of empty slices (slice(None)) to match the dimensions of an object.

    File: /tf/active/vicechatdev/patches/util.py

    indexing ellipsis slicing multi-dimensional data-structures
  • class sanitize_identifier_fn

    A parameterized function class that sanitizes strings (group/label values) to make them safe for use as Python attribute names in AttrTree structures by converting special characters to their unicode names and applying transformations.

    File: /tf/active/vicechatdev/patches/util.py

    sanitization identifier unicode string-processing attribute-names
  • function wrap_tuple

    A utility function that ensures the input is wrapped in a tuple, leaving existing tuples unchanged and wrapping non-tuple values in a single-element tuple.

    File: /tf/active/vicechatdev/patches/util.py

    utility tuple type-conversion normalization wrapper
  • function get_path

    Extracts and sanitizes a hierarchical path from a Labelled object or a tuple containing an existing path and a Labelled object, returning a tuple of capitalized, sanitized path components.

    File: /tf/active/vicechatdev/patches/util.py

    path-construction sanitization hierarchical-data labelled-objects string-processing
  • function parse_datetime_selection

    Converts string or datetime-like selection specifications into parsed datetime objects, handling single values, slices, and collections.

    File: /tf/active/vicechatdev/patches/util.py

    datetime parsing data-selection time-series normalization
  • class PatentClient

    A client class for searching and retrieving patent data from multiple sources including USPTO, EPO (European Patent Office), and The Lens API.

    File: /tf/active/vicechatdev/QA_updater/data_access/patent_client.py

    patent-search api-client uspto epo lens
  • class CompanyNewsClient

    A client class for accessing company news and information from multiple sources including GDELT Project and NewsAPI, with built-in rate limiting and error handling.

    File: /tf/active/vicechatdev/QA_updater/data_access/company_news_client.py

    news-api gdelt news-aggregation company-news rate-limiting
  • class FormatNormalizer

    Normalizes extracted data formats to ensure consistency. Handles: - Date format standardization - Number/currency normalization - VAT/tax number formatting - Field name standardization - Address formatting - Field value cleaning

    File: /tf/active/vicechatdev/invoice_extraction/utils/format_normalizer.py

    class formatnormalizer

Search Examples