🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "collection"

Found 50 matching component(s)

  • class OneCo_hybrid_RAG

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    class oneco_hybrid_rag
  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator
  • class pathobrowser_base

    Base class that contains all static elements of the app Parameters ---------- image : str An Image UID which may be passed on app startup. Immediately redirects to said image Attributes ---------- current_user : Userclass A class containing various information on the user workspace : panel.layout.Column The main container of the app sidebar : panel.layout.Column Container showing items on the side of the app head : panel.layout.Row The header of the app modal : panel.layout.Column The container for the modal window of the app

    File: /tf/active/vicechatdev/datacapture_integrated.py

    class pathobrowser_base
  • class OneCo_hybrid_RAG_v1

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG_old.py

    class oneco_hybrid_rag
  • class DocumentProcessor_v5

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

    class documentprocessor
  • class options

    A Panel-based UI class for managing slide release visibility in a study management system, allowing users to view and toggle the release status of slides at various hierarchical levels (Study, Group, Animal, Block, Slide).

    File: /tf/active/vicechatdev/options.py

    panel ui neo4j graph-database slide-management
  • class ImprovedProjectVictoriaGenerator

    Improved Project Victoria Disclosure Generator with proper reference management.

    File: /tf/active/vicechatdev/improved_project_victoria_generator.py

    class improvedprojectvictoriagenerator
  • class OneCo_hybrid_RAG_v2

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag
  • class ExtensiveSearchManager

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class extensivesearchmanager
  • function test_chroma_collections

    A diagnostic function that tests connectivity to ChromaDB instances across multiple connection methods and lists all available collections with their metadata.

    File: /tf/active/vicechatdev/test_chroma_collections.py

    chromadb database-testing diagnostics connection-testing vector-database
  • function test_collection_creation

    A diagnostic test function that verifies Chroma DB functionality by creating a test collection, adding a document, querying it, and cleaning up.

    File: /tf/active/vicechatdev/test_chroma_collections.py

    testing debugging chroma-db vector-database health-check
  • function main_v47

    Entry point function that executes a comprehensive test suite for Chroma DB collections, including collection listing and creation tests, followed by troubleshooting suggestions.

    File: /tf/active/vicechatdev/test_chroma_collections.py

    testing debugging chroma-db database collections
  • class ProjectVictoriaDisclosureGenerator

    Main class for generating Project Victoria disclosures from warranty claims.

    File: /tf/active/vicechatdev/project_victoria_disclosure_generator.py

    class projectvictoriadisclosuregenerator
  • class DocumentProcessor_v6

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi.py

    class documentprocessor
  • class pathobrowser_base_v1

    Base class that contains all static elements of the app Parameters ---------- image : str An Image UID which may be passed on app startup. Immediately redirects to said image Attributes ---------- current_user : Userclass A class containing various information on the user workspace : panel.layout.Column The main container of the app sidebar : panel.layout.Column Container showing items on the side of the app head : panel.layout.Row The header of the app modal : panel.layout.Column The container for the modal window of the app

    File: /tf/active/vicechatdev/datacapture.py

    class pathobrowser_base
  • function allowed_file

    Validates whether a filename has an allowed file extension by checking if it contains a dot and if the extension (case-insensitive) exists in a predefined ALLOWED_EXTENSIONS collection.

    File: /tf/active/vicechatdev/leexi/app.py

    validation file-upload security flask file-extension
  • function main_v50

    Interactive CLI helper function that generates and displays instructions for granting SharePoint app permissions to an Azure AD application.

    File: /tf/active/vicechatdev/SPFCsync/grant_sharepoint_access.py

    sharepoint azure-ad permissions cli-helper configuration
  • function reset_collection

    Deletes an existing ChromaDB collection and logs the operation, requiring an application restart to recreate the collection.

    File: /tf/active/vicechatdev/docchat/reset_collection.py

    chromadb vector-database database-management collection-reset cleanup
  • function index_documents_example

    A demonstration function that indexes documents from a specified folder using a DocumentIndexer, creating the folder if it doesn't exist, and displays indexing results and collection statistics.

    File: /tf/active/vicechatdev/docchat/example_usage.py

    document-indexing example tutorial demonstration RAG
  • function cleanup_old_tasks

    Removes tasks from the active_tasks dictionary that are older than 1 hour (3600 seconds) based on their creation timestamp.

    File: /tf/active/vicechatdev/docchat/app.py

    cleanup maintenance task-management thread-safe memory-management
  • function get_document_info

    Retrieves indexing status and metadata for a document, including whether it's indexed, its document ID, chunk count, and reindexing status.

    File: /tf/active/vicechatdev/docchat/app.py

    document-management indexing metadata vector-database chromadb
  • function index_v1

    Flask route handler that renders the main application page with user session management, authentication checks, and document collection statistics.

    File: /tf/active/vicechatdev/docchat/app.py

    flask web-route session-management authentication main-page
  • function api_update_system_config

    Flask API endpoint that allows administrators to update system configuration settings including system role, expertise, domain context, custom instructions, output style, and query languages, with persistence to disk.

    File: /tf/active/vicechatdev/docchat/app.py

    admin configuration system-settings api-endpoint flask
  • function api_documents

    Flask API endpoint that retrieves statistics and metadata about indexed documents from a document indexer service.

    File: /tf/active/vicechatdev/docchat/app.py

    flask api rest-endpoint document-management statistics
  • function check_configuration

    A comprehensive configuration verification function that checks and displays the status of all DocChat system settings, including API keys, models, ChromaDB connection, directories, and LLM initialization.

    File: /tf/active/vicechatdev/docchat/verify_setup.py

    configuration verification diagnostics setup validation
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • function index_v2

    Flask route handler that renders the main DocChat interface with document collection statistics.

    File: /tf/active/vicechatdev/docchat/blueprint.py

    flask route-handler web-interface authentication rag
  • function get_stats

    Flask API endpoint that retrieves and returns statistics about a document collection from a RAG (Retrieval-Augmented Generation) system.

    File: /tf/active/vicechatdev/docchat/blueprint.py

    flask api endpoint statistics rag
  • class DocumentIndexer

    A class for indexing documents into ChromaDB with support for multiple file formats (PDF, Word, PowerPoint, Excel, text files), smart incremental indexing, and document chunk management.

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    document-indexing vector-database chromadb embeddings pdf-processing
  • function test_incremental_indexing

    Comprehensive test function that validates incremental indexing functionality of a document indexing system, including initial indexing, change detection, re-indexing, and force re-indexing scenarios.

    File: /tf/active/vicechatdev/docchat/test_incremental_indexing.py

    testing incremental-indexing document-indexing integration-test file-system
  • function configure_docchat

    Configures DocChat module settings by overriding default configuration values from a Flask application instance and optional keyword arguments.

    File: /tf/active/vicechatdev/docchat/integration.py

    configuration flask setup chromadb openai
  • function main_v105

    Command-line interface function that orchestrates the cleaning of ChromaDB collections by removing duplicates and similar documents, with options to skip collections and customize the cleaning process.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    cli command-line chromadb database-cleaning deduplication
  • function clean_collection

    Cleans a ChromaDB collection by removing duplicate and similar documents using hash-based and similarity-based deduplication techniques, then saves the cleaned data to a new collection.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    data-cleaning deduplication chromadb vector-database similarity-detection
  • function load_data_from_chromadb

    Connects to a ChromaDB instance and retrieves all documents from a specified collection, returning them as a list of dictionaries with document IDs, text content, embeddings, and metadata.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    chromadb vector-database data-loading document-retrieval embeddings
  • function save_data_to_chromadb_v1

    Saves a list of document dictionaries to a ChromaDB collection, with support for batch processing, embeddings, and metadata storage.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    chromadb vector-database document-storage embeddings batch-processing
  • function main_v89

    Command-line interface function that orchestrates a ChromaDB collection cleaning pipeline by removing duplicate and similar documents through hashing and similarity screening.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    cli command-line data-cleaning deduplication chromadb
  • function load_data_from_chromadb_v1

    Retrieves all documents from a specified ChromaDB collection, including their IDs, text content, embeddings, and metadata.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    chromadb database document-retrieval vector-database embeddings
  • function save_data_to_chromadb

    Saves a list of document dictionaries to a ChromaDB vector database collection, optionally including embeddings and metadata.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    chromadb vector-database document-storage embeddings persistence
  • function test_identical_text_removal

    A pytest test function that verifies the SimilarityCleaner's ability to remove identical duplicate text entries from a list while preserving unique documents.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_similarity_cleaner.py

    testing pytest unit-test deduplication text-processing
  • class Config_v6

    A dataclass that stores configuration settings for a ChromaDB cleanup process, including connection parameters, cleaning/clustering options, and summarization settings.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/config.py

    configuration dataclass chromadb settings cleanup
  • function build_similarity_matrix

    Computes a pairwise cosine similarity matrix for a collection of embedding vectors, where each cell (i,j) represents the similarity between embedding i and embedding j.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/similarity_utils.py

    embeddings similarity cosine-similarity matrix nlp
  • function find_similar_documents

    Identifies pairs of similar documents by comparing their embeddings and returns those exceeding a specified similarity threshold, sorted by similarity score.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/similarity_utils.py

    document-similarity embedding-comparison duplicate-detection cosine-similarity nlp
  • function identify_duplicates

    Identifies duplicate documents by computing hash values of their text content and grouping documents with identical hashes.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/utils/hash_utils.py

    deduplication document-processing hashing data-cleaning duplicate-detection
  • class HashCleaner

    A document deduplication cleaner that removes documents with identical content by comparing hash values of document text.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/hash_cleaner.py

    deduplication data-cleaning hash-based document-processing duplicate-removal
  • class AgentExecutor

    Agent-based script executor that generates standalone Python files, manages dependencies, and provides iterative debugging capabilities

    File: /tf/active/vicechatdev/vice_ai/agent_executor.py

    class agentexecutor
  • function remove_uploaded_document_v1

    Removes a specific uploaded document from a user's document collection in the application state, with thread-safe locking and automatic cleanup of empty user entries.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    document-management thread-safe state-management deletion cleanup
  • function api_send_chat_message

    Flask API endpoint that handles sending a message in a chat session, processes it through a hybrid RAG engine with configurable search and memory settings, and returns an AI-generated response with references.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    flask api chat rag hybrid-rag
  • function cleanup_old_documents

    Periodically removes documents and their associated files that are older than 2 hours from the uploaded_documents dictionary, cleaning up both file system storage and memory.

    File: /tf/active/vicechatdev/vice_ai/app.py

    cleanup maintenance file-management document-management scheduled-task
  • function process_chat_request_background

    Process chat request in background thread

    File: /tf/active/vicechatdev/vice_ai/app.py

    function process_chat_request_background
  • function api_chat_v1

    Handle chat API requests with support for long-running tasks

    File: /tf/active/vicechatdev/vice_ai/app.py

    function api_chat

Search Examples