🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "hash"

Found 45 matching component(s)

  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator
  • class ExtensiveSearchManager

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class extensivesearchmanager
  • class DocxMerger

    A class named DocxMerger

    File: /tf/active/vicechatdev/word_merge.py

    class docxmerger
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • class FileCloudClient_v1

    A client class for interacting with FileCloud storage systems through direct API calls, providing authentication, file search, download, and metadata retrieval capabilities.

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/filecloud_client.py

    filecloud storage api-client file-management document-management
  • function main_v51

    Entry point function that demonstrates document processing workflow by creating an audited, watermarked, and protected PDF/A document from a DOCX file with audit trail data.

    File: /tf/active/vicechatdev/document_auditor/main.py

    document-processing pdf-generation audit-trail watermarking pdf-a-compliance
  • class DocumentProcessor

    A comprehensive document processing class that converts documents to PDF, adds audit trails, applies security features (watermarks, signatures, hashing), and optionally converts to PDF/A format with document protection.

    File: /tf/active/vicechatdev/document_auditor/src/document_processor.py

    document-processing pdf-generation audit-trail security watermarking
  • class HashGenerator

    A class that provides cryptographic hashing functionality for PDF documents, including hash generation, embedding, and verification for document integrity checking.

    File: /tf/active/vicechatdev/document_auditor/src/security/hash_generator.py

    cryptography hashing SHA-256 PDF document-integrity
  • function clean_collection

    Cleans a ChromaDB collection by removing duplicate and similar documents using hash-based and similarity-based deduplication techniques, then saves the cleaned data to a new collection.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    data-cleaning deduplication chromadb vector-database similarity-detection
  • function main_v52

    Command-line interface function that orchestrates a ChromaDB collection cleaning pipeline by removing duplicate and similar documents through hashing and similarity screening.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    cli command-line data-cleaning deduplication chromadb
  • function hash_cleaner

    A pytest fixture that instantiates and returns a HashCleaner object for use in test cases.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_hash_cleaner.py

    pytest fixture testing hash cleaner
  • function test_remove_identical_chunks

    A pytest test function that verifies the HashCleaner's ability to remove duplicate text chunks from a list while preserving order and unique entries.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_hash_cleaner.py

    testing pytest unit-test deduplication text-processing
  • function test_empty_input_v1

    A pytest test function that verifies the HashCleaner's behavior when processing an empty list of text chunks.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_hash_cleaner.py

    testing unit-test pytest edge-case boundary-condition
  • function test_no_identical_chunks

    A unit test function that verifies the HashCleaner's behavior when processing a list of unique text chunks, ensuring no chunks are removed when all are distinct.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_hash_cleaner.py

    unit-test pytest hash-cleaner deduplication text-processing
  • function test_identical_chunks_with_different_cases

    A unit test function that verifies the HashCleaner's ability to remove duplicate text chunks while being case-sensitive, ensuring that strings differing only in case are treated as distinct entries.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_hash_cleaner.py

    unit-test pytest deduplication case-sensitive text-processing
  • class Config_v6

    A dataclass that stores configuration settings for a ChromaDB cleanup process, including connection parameters, cleaning/clustering options, and summarization settings.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/config.py

    configuration dataclass chromadb settings cleanup
  • function hash_text

    Creates a SHA-256 hash of normalized text content to generate a unique identifier for documents, enabling duplicate detection and content comparison.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    hashing text-processing deduplication content-fingerprinting sha256
  • function identify_duplicates

    Identifies duplicate documents by computing hash values of their text content and grouping documents with identical hashes.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    deduplication document-processing hashing data-cleaning duplicate-detection
  • function get_unique_documents

    Identifies and separates unique documents from duplicates in a list by comparing hash values of document text content.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    deduplication document-processing data-cleaning hashing text-processing
  • class HashCleaner

    A document deduplication cleaner that removes documents with identical content by comparing hash values of document text.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/hash_cleaner.py

    deduplication data-cleaning hash-based document-processing duplicate-removal
  • class CombinedCleaner

    A document cleaner that combines hash-based and similarity-based cleaning approaches to remove both exact and near-duplicate documents in a two-stage process.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/combined_cleaner.py

    document-cleaning deduplication data-processing hash-based similarity-based
  • class ExtensiveSearchManager_v1

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    class extensivesearchmanager
  • class DocumentVersion_v1

    Model representing a specific version of a controlled document.

    File: /tf/active/vicechatdev/CDocs/models/document.py

    class documentversion
  • class DocUser

    Extended User model for document management system. This class wraps and extends the existing User model in the system, adding document-specific roles, departments, and permissions.

    File: /tf/active/vicechatdev/CDocs/models/user_extensions.py

    class docuser
  • function create_user

    Creates a new user in the system with validation, password hashing, and role assignment, including CDocs user extension setup.

    File: /tf/active/vicechatdev/CDocs/controllers/admin_controller.py

    user-management authentication database validation password-hashing
  • class PDFGenerator

    PDF document generation for reports and controlled documents This class provides methods to generate PDF documents from scratch, including audit reports, document covers, and certificate pages.

    File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

    class pdfgenerator
  • class ControlledDocumentConverter

    A comprehensive document converter class that transforms controlled documents into archived PDFs with signature pages, audit trails, hash-based integrity verification, and PDF/A compliance for long-term archival.

    File: /tf/active/vicechatdev/CDocs/utils/document_converter.py

    document-conversion pdf-generation audit-trail document-control archival
  • function process_document

    Processes a document file (DOCX, DOC, or PDF) and extracts comprehensive metadata including file information, content metadata, and cryptographic hash.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    document-processing metadata-extraction file-handling pdf docx
  • function extract_metadata

    Extracts metadata from file content by analyzing the file type and computing file properties including hash, size, and type-specific metadata.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    metadata-extraction file-processing document-analysis hash-computation file-type-detection
  • function compare_document_versions

    Compares two document versions by their UIDs and generates a summary of changes including metadata differences and hash comparisons.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    document-management version-control comparison diff metadata
  • function check_document_hash_exists

    Checks if a document with a given SHA-256 hash already exists in the database by querying the graph database for matching DocumentVersion nodes.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    database graph-database neo4j document-management deduplication
  • class AdminPanel

    Admin configuration interface component

    File: /tf/active/vicechatdev/CDocs/ui/admin_panel.py

    class adminpanel
  • class DynamicSchemaDiscovery

    Discovers database schema from live database connection

    File: /tf/active/vicechatdev/full_smartstat/dynamic_schema_discovery.py

    class dynamicschemadiscovery
  • function download_generated_file

    Flask route handler that downloads generated files from a user's session directory, with security checks and support for nested analysis subdirectories.

    File: /tf/active/vicechatdev/full_smartstat/app.py

    flask file-download web-endpoint security path-traversal-protection
  • function serve_generated_file

    Flask route handler that serves generated files (images, HTML, CSS, JS, etc.) from session-specific directories, with security checks and automatic MIME type detection.

    File: /tf/active/vicechatdev/full_smartstat/app.py

    flask file-serving web-route security mime-type
  • class StatisticalAnalysisService

    Main service for statistical analysis orchestration

    File: /tf/active/vicechatdev/full_smartstat/services.py

    class statisticalanalysisservice
  • class StatisticalAnalysisService_v1

    Main service for statistical analysis orchestration

    File: /tf/active/vicechatdev/smartstat/services.py

    class statisticalanalysisservice
  • class HashableJSON

    A JSON encoder extension that generates hashable string representations for a wide variety of Python objects, including those not normally JSON-serializable like sets, numpy arrays, and pandas DataFrames.

    File: /tf/active/vicechatdev/patches/util.py

    json hashing serialization memoization caching
  • function deephash

    Computes a hash value for any Python object by serializing it to JSON using a custom HashableJSON encoder and returning the hash of the resulting string.

    File: /tf/active/vicechatdev/patches/util.py

    hashing serialization json object-comparison caching
  • class Callable

    Callable is a wrapper class for callback functions used with DynamicMaps, providing memoization, stream management, and input/output tracking capabilities.

    File: /tf/active/vicechatdev/patches/spaces.py

    callback wrapper memoization caching dynamic-visualization
  • class DynamicMap

    A DynamicMap is a type of HoloMap where the elements are dynamically generated by a callable. The callable is invoked with values associated with the key dimensions or with values supplied by stream parameters.

    File: /tf/active/vicechatdev/patches/spaces.py

    class dynamicmap
  • class DocumentDownloader

    A client class for downloading documents (primarily PDFs) from various sources, managing download caching, respecting rate limits per domain, and processing documents using llmsherpa for content extraction.

    File: /tf/active/vicechatdev/QA_updater/data_access/document_downloader.py

    document-download pdf-processing rate-limiting caching llmsherpa
  • class ExtensiveSearchManager_v2

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/data_capture_backup_18072025/OneCo_hybrid_RAG.py

    class extensivesearchmanager
  • class DocumentVersion

    Model class representing a specific version of a controlled document in a document management system, handling version metadata, file paths, status tracking, and review workflows.

    File: /tf/active/vicechatdev/CDocs single class/models/document.py

    document-management version-control file-storage workflow review-cycle
  • class DocUser_v1

    Extended User model for document management system. This class wraps and extends the existing User model in the system, adding document-specific roles, departments, and permissions.

    File: /tf/active/vicechatdev/CDocs single class/models/user_extensions.py

    class docuser

Search Examples