🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "hashing"

Found 14 matching component(s)

  • class DocumentProcessor

    A comprehensive document processing class that converts documents to PDF, adds audit trails, applies security features (watermarks, signatures, hashing), and optionally converts to PDF/A format with document protection.

    File: /tf/active/vicechatdev/document_auditor/src/document_processor.py

    document-processing pdf-generation audit-trail security watermarking
  • class HashGenerator

    A class that provides cryptographic hashing functionality for PDF documents, including hash generation, embedding, and verification for document integrity checking.

    File: /tf/active/vicechatdev/document_auditor/src/security/hash_generator.py

    cryptography hashing SHA-256 PDF document-integrity
  • function clean_collection

    Cleans a ChromaDB collection by removing duplicate and similar documents using hash-based and similarity-based deduplication techniques, then saves the cleaned data to a new collection.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    data-cleaning deduplication chromadb vector-database similarity-detection
  • function main_v51

    Command-line interface function that orchestrates a ChromaDB collection cleaning pipeline by removing duplicate and similar documents through hashing and similarity screening.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    cli command-line data-cleaning deduplication chromadb
  • function hash_text

    Creates a SHA-256 hash of normalized text content to generate a unique identifier for documents, enabling duplicate detection and content comparison.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    hashing text-processing deduplication content-fingerprinting sha256
  • function identify_duplicates

    Identifies duplicate documents by computing hash values of their text content and grouping documents with identical hashes.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    deduplication document-processing hashing data-cleaning duplicate-detection
  • function get_unique_documents

    Identifies and separates unique documents from duplicates in a list by comparing hash values of document text content.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    deduplication document-processing data-cleaning hashing text-processing
  • class DocUser

    Extended User model for document management system. This class wraps and extends the existing User model in the system, adding document-specific roles, departments, and permissions.

    File: /tf/active/vicechatdev/CDocs/models/user_extensions.py

    class docuser
  • function create_user

    Creates a new user in the system with validation, password hashing, and role assignment, including CDocs user extension setup.

    File: /tf/active/vicechatdev/CDocs/controllers/admin_controller.py

    user-management authentication database validation password-hashing
  • function update_user

    Updates an existing user's information in a Neo4j database, including profile fields, password, and role assignments.

    File: /tf/active/vicechatdev/CDocs/controllers/admin_controller.py

    user-management database-update neo4j authentication role-management
  • function process_document

    Processes a document file (DOCX, DOC, or PDF) and extracts comprehensive metadata including file information, content metadata, and cryptographic hash.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    document-processing metadata-extraction file-handling pdf docx
  • function _int_to_bytes

    Converts a signed integer to its little-endian byte representation, automatically determining the minimum number of bytes needed based on the integer's bit length.

    File: /tf/active/vicechatdev/patches/util.py

    serialization bytes integer-conversion little-endian binary
  • class HashableJSON

    A JSON encoder extension that generates hashable string representations for a wide variety of Python objects, including those not normally JSON-serializable like sets, numpy arrays, and pandas DataFrames.

    File: /tf/active/vicechatdev/patches/util.py

    json hashing serialization memoization caching
  • function deephash

    Computes a hash value for any Python object by serializing it to JSON using a custom HashableJSON encoder and returning the hash of the resulting string.

    File: /tf/active/vicechatdev/patches/util.py

    hashing serialization json object-comparison caching

Search Examples