🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "chunking"

Found 12 matching component(s)

  • class QueryBasedExtractor_v2

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    information-extraction text-processing llm openai query-based
  • class ExtensiveSearchManager

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class extensivesearchmanager
  • class QueryBasedExtractor

    A class that extracts relevant information from documents using a small LLM (Language Model), designed for Extensive and Full Reading modes in RAG systems.

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    information-extraction document-processing llm rag query-based
  • class DocumentProcessor_v7

    Process different document types for indexing

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    class documentprocessor
  • class DocumentIndexer

    A class for indexing documents into ChromaDB with support for multiple file formats (PDF, Word, PowerPoint, Excel, text files), smart incremental indexing, and document chunk management.

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    document-indexing vector-database chromadb embeddings pdf-processing
  • class AgentExecutor

    Agent-based script executor that generates standalone Python files, manages dependencies, and provides iterative debugging capabilities

    File: /tf/active/vicechatdev/vice_ai/agent_executor.py

    class agentexecutor
  • class QueryBasedExtractor_v1

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    information-extraction llm openai text-processing query-based
  • class ExtensiveSearchManager_v1

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    class extensivesearchmanager
  • function test_enhanced_pdf_processing

    A comprehensive test function that validates PDF processing capabilities, including text extraction, cleaning, chunking, and table detection across multiple PDF processing libraries.

    File: /tf/active/vicechatdev/vice_ai/test_enhanced_pdf.py

    testing pdf-processing document-processing diagnostic text-extraction
  • function create_test_file

    Creates a temporary text file with predefined multi-chapter test content for testing document extraction and processing functionality.

    File: /tf/active/vicechatdev/vice_ai/test_extraction_debug.py

    testing file-creation temporary-file test-data document-processing
  • class VersionComparisonService

    A service class that compares two versions of a document using LLM-based analysis, implementing smart segmentation and chunking for handling large documents efficiently.

    File: /tf/active/vicechatdev/CDocs/utils/version_comparison.py

    document-comparison version-control llm openai text-analysis
  • class AgentExecutor_v1

    Agent-based script executor that generates standalone Python files, manages dependencies, and provides iterative debugging capabilities

    File: /tf/active/vicechatdev/full_smartstat/agent_executor.py

    class agentexecutor

Search Examples