Search - Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "normalization"

Found 26 matching component(s)

function clean_text

Cleans and normalizes text content by removing HTML tags, normalizing whitespace, and stripping markdown formatting elements.

File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

text-processing text-cleaning normalization html-removal markdown-removal
function extract_warranty_data_improved

Parses markdown-formatted warranty documentation to extract structured warranty data including IDs, titles, sections, disclosure text, and reference citations.

File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

markdown-parsing text-extraction warranty-processing document-parsing regex
function parse_email_address

Parses email address strings by handling multiple addresses separated by semicolons and converting them to comma-separated format.

File: /tf/active/vicechatdev/msg_to_eml.py

email parsing string-manipulation formatting address-normalization
function matches_source_filter

Checks if a document path matches any of the provided source filters using exact match, folder prefix match, path component sequence match, or filename match.

File: /tf/active/vicechatdev/docchat/rag_engine.py

path-matching file-filtering document-filtering path-normalization string-matching
class SignatureManager

A class that manages digital signature images for documents, providing functionality to store, retrieve, and list signature files in a designated directory.

File: /tf/active/vicechatdev/document_auditor/src/security/signature_manager.py

signature-management document-processing file-management image-processing digital-signatures
function hash_text

Creates a SHA-256 hash of normalized text content to generate a unique identifier for documents, enabling duplicate detection and content comparison.

File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

hashing text-processing deduplication content-fingerprinting sha256
function clean_html_tags

Removes HTML tags and entities from text strings, returning clean plain text suitable for PDF display or other formatted output.

File: /tf/active/vicechatdev/vice_ai/complex_app.py

html text-processing sanitization string-manipulation pdf-generation
function convert_european_decimals

Detects and converts numeric data with European decimal format (comma as decimal separator) to standard format (dot as decimal separator) in a pandas DataFrame, handling mixed formats and missing data patterns.

File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

data-processing data-cleaning decimal-conversion european-format locale-handling
function calculate_cv

Calculates the coefficient of variation (CV) for a dataset, expressed as a percentage of the standard deviation relative to the mean.

File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/d48d7789-9627-4e96-9f48-f90b687cd07d/analysis_1.py

statistics coefficient-of-variation data-analysis variability dispersion
function get_document_type_code

Retrieves a document type code from a dictionary lookup using the provided document type name, returning the name itself if no mapping exists.

File: /tf/active/vicechatdev/CDocs/settings_prod.py

lookup mapping document-type code-conversion dictionary-lookup
function extract_metadata_from_filecloud

Extracts and normalizes metadata from FileCloud for document creation, providing default values and generating document numbers when needed.

File: /tf/active/vicechatdev/CDocs/FC_sync.py

metadata normalization document-management filecloud data-extraction
function get_documents

Retrieves controlled documents from a Neo4j database with comprehensive filtering, permission-based access control, pagination, and full-text search capabilities.

File: /tf/active/vicechatdev/CDocs/controllers/document_controller.py

document-management neo4j graph-database filtering pagination
function normalize_uid

Normalizes a UID value by converting it to a string, or generates a new UUID v4 if the input is empty or None.

File: /tf/active/vicechatdev/CDocs/utils/uid_helper.py

uid uuid normalization database identifier
function _process_event_record

Processes a single event record from a database by normalizing timestamps, adding user information, and parsing JSON details fields.

File: /tf/active/vicechatdev/CDocs/utils/audit_trail.py

data-processing event-processing database timestamp-normalization json-parsing
function _origin_url

Removes the protocol scheme (http/https) from a URL string, returning only the domain and path portion.

File: /tf/active/vicechatdev/patches/server.py

url-processing string-manipulation protocol-removal url-normalization web-utilities
function process_ellipses

Expands an Ellipsis (...) in a __getitem__ key by replacing it with the appropriate number of empty slices (slice(None)) to match the dimensions of an object.

File: /tf/active/vicechatdev/patches/util.py

indexing ellipsis slicing multi-dimensional data-structures
class sanitize_identifier_fn

A parameterized function class that sanitizes strings (group/label values) to make them safe for use as Python attribute names in AttrTree structures by converting special characters to their unicode names and applying transformations.

File: /tf/active/vicechatdev/patches/util.py

sanitization identifier unicode string-processing attribute-names
function wrap_tuple

A utility function that ensures the input is wrapped in a tuple, leaving existing tuples unchanged and wrapping non-tuple values in a single-element tuple.

File: /tf/active/vicechatdev/patches/util.py

utility tuple type-conversion normalization wrapper
function get_path

Extracts and sanitizes a hierarchical path from a Labelled object or a tuple containing an existing path and a Labelled object, returning a tuple of capitalized, sanitized path components.

File: /tf/active/vicechatdev/patches/util.py

path-construction sanitization hierarchical-data labelled-objects string-processing
function parse_datetime_selection

Converts string or datetime-like selection specifications into parsed datetime objects, handling single values, slices, and collections.

File: /tf/active/vicechatdev/patches/util.py

datetime parsing data-selection time-series normalization
class PatentClient

A client class for searching and retrieving patent data from multiple sources including USPTO, EPO (European Patent Office), and The Lens API.

File: /tf/active/vicechatdev/QA_updater/data_access/patent_client.py

patent-search api-client uspto epo lens
class CompanyNewsClient

A client class for accessing company news and information from multiple sources including GDELT Project and NewsAPI, with built-in rate limiting and error handling.

File: /tf/active/vicechatdev/QA_updater/data_access/company_news_client.py

news-api gdelt news-aggregation company-news rate-limiting
class FormatNormalizer

Normalizes extracted data formats to ensure consistency. Handles: - Date format standardization - Number/currency normalization - VAT/tax number formatting - Field name standardization - Address formatting - Field value cleaning

File: /tf/active/vicechatdev/invoice_extraction/utils/format_normalizer.py

class formatnormalizer
function send_email

Sends templated emails to one or more recipients using either MS365 or SMTP provider based on configuration, with support for CC, BCC, and attachments.

File: /tf/active/vicechatdev/CDocs single class/utils/notifications.py

email notification messaging communication template
function fix_file_dates

Normalizes all timestamp attributes (creation, modification, access) of a file to the oldest timestamp among them, with optional dry-run mode for preview.

File: /tf/active/vicechatdev/mailsearch/fix_file_dates.py

file-metadata timestamp-normalization file-dates filesystem metadata-management
function fuzzy_match_filename

Calculates a fuzzy match similarity score between two filenames by comparing them after normalization, using exact matching, substring containment, and word overlap techniques.

File: /tf/active/vicechatdev/mailsearch/compare_documents.py

string-matching fuzzy-matching filename-comparison similarity-score text-processing

Search Examples

validation - Find validation functions
database - Find database-related components
email - Find email processing functions
api - Find API-related components
authentication - Find auth components

Search Components

Search Results for "normalization"

function clean_text

function extract_warranty_data_improved

function parse_email_address

function matches_source_filter

class SignatureManager

function hash_text

function clean_html_tags

function convert_european_decimals

function calculate_cv

function get_document_type_code

function extract_metadata_from_filecloud

function get_documents

function normalize_uid

function _process_event_record

function _origin_url

function process_ellipses

class sanitize_identifier_fn

function wrap_tuple

function get_path

function parse_datetime_selection

class PatentClient

class CompanyNewsClient

class FormatNormalizer

function send_email

function fix_file_dates

function fuzzy_match_filename

Search Examples