🔍 Code Extractor

Browse Components

Showing 20 of 1740 components

  • function check_debug_endpoint

    Queries a debug endpoint to retrieve and display cache information from a web server, including debug mode status, cache buster values, and static file versions.

    File: /tf/active/vicechatdev/vice_ai/dev_tools.py | Lines: 59-82

    debugging cache monitoring http web-server
  • function touch_static_files

    Updates the modification timestamp of CSS and JavaScript files in a static directory to force browser cache refresh.

    File: /tf/active/vicechatdev/vice_ai/dev_tools.py | Lines: 38-57

    cache-busting static-files file-system deployment development-tools
  • function check_static_files

    Scans a 'static' directory for CSS and JavaScript files and prints their modification times relative to the current time.

    File: /tf/active/vicechatdev/vice_ai/dev_tools.py | Lines: 22-36

    file-system diagnostics static-files monitoring development-tools
  • function clear_browser_cache_instructions

    A utility function that prints formatted instructions to the console for clearing browser cache across different web browsers and operating systems.

    File: /tf/active/vicechatdev/vice_ai/dev_tools.py | Lines: 13-20

    browser cache instructions user-help web-development
  • function test_tab_delimited_european

    A unit test function that validates the smart_read_csv function's ability to correctly parse tab-delimited CSV files containing European-style decimal numbers (using commas instead of periods).

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py | Lines: 152-183

    testing unit-test csv-parsing european-decimals tab-delimited
  • function test_us_with_thousands

    A unit test function that validates the smart_read_csv function's ability to correctly parse US-formatted CSV files containing numbers with thousand separators (commas) and decimal points.

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py | Lines: 118-149

    testing unit-test csv-parsing data-validation number-formatting
  • function test_european_with_thousands

    A unit test function that validates the smart_read_csv function's ability to correctly parse European-formatted CSV files with thousand separators (dots) and decimal commas.

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py | Lines: 84-115

    testing unit-test csv-parsing european-format number-formatting
  • function test_us_csv

    A unit test function that validates the smart_read_csv function's ability to correctly parse US-formatted CSV files with comma delimiters and point decimal separators.

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py | Lines: 48-81

    testing unit-test csv data-parsing us-format
  • function test_european_csv

    A test function that validates the ability to read and parse European-formatted CSV files (semicolon delimiters, comma decimal separators) and convert them to proper numeric types.

    File: /tf/active/vicechatdev/vice_ai/test_regional_formats.py | Lines: 12-45

    testing csv european-format data-parsing unit-test
  • function raw_cleanup_database

    Performs raw database cleanup on a SQLite database to identify and fix corrupted chat_session_id values in the text_sections table by converting invalid string representations ('{}', '[]', 'null', '') to NULL.

    File: /tf/active/vicechatdev/vice_ai/raw_database_cleanup.py | Lines: 8-90

    database cleanup maintenance sqlite data-integrity
  • class BaseCleaner

    Abstract base class that defines the interface for document cleaning implementations, providing methods to remove redundancy from document collections and track cleaning statistics.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/base_cleaner.py | Lines: 6-46

    abstract-base-class document-processing data-cleaning redundancy-removal statistics
  • class SimilarityCleaner

    A document cleaning class that identifies and removes duplicate or highly similar documents based on embedding vector similarity, keeping only representative documents from each similarity group.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/similarity_cleaner.py | Lines: 8-79

    document-processing deduplication similarity embeddings clustering
  • class CombinedCleaner

    A document cleaner that combines hash-based and similarity-based cleaning approaches to remove both exact and near-duplicate documents in a two-stage process.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/combined_cleaner.py | Lines: 8-45

    document-cleaning deduplication data-processing hash-based similarity-based
  • class HashCleaner

    A document deduplication cleaner that removes documents with identical content by comparing hash values of document text.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/hash_cleaner.py | Lines: 7-36

    deduplication data-cleaning hash-based document-processing duplicate-removal
  • class SummarizationConfig

    A configuration wrapper class that manages settings for a text summarization model by encapsulating a SummarizationModel instance.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/summarization/models.py | Lines: 8-17

    configuration summarization model-config wrapper nlp
  • class SummarizationModel

    A Pydantic data model class that defines the configuration schema for a text summarization model, including model name, token limits, and temperature settings.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/summarization/models.py | Lines: 3-6

    pydantic data-model configuration validation summarization
  • function create_summary

    Creates a text summary using OpenAI's GPT models or returns a truncated version as fallback when API key is unavailable.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/summarization/summarizer.py | Lines: 38-79

    summarization text-processing openai gpt nlp
  • function summarize_text

    A deprecated standalone function that was originally designed to summarize groups of similar documents but now only returns the input documents unchanged with a deprecation warning.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/summarization/summarizer.py | Lines: 22-35

    deprecated text-summarization document-processing nlp text-clustering
  • function init_openai_client

    Initializes the OpenAI client by setting the API key from either a provided parameter or environment variable.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/summarization/summarizer.py | Lines: 7-19

    initialization authentication openai api-key configuration
  • function get_unique_documents

    Identifies and separates unique documents from duplicates in a list by comparing hash values of document text content.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py | Lines: 44-67

    deduplication document-processing data-cleaning hashing text-processing