🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "combine"

Found 50 matching component(s)

  • class RegulatoryExtractor

    A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

    File: /tf/active/vicechatdev/reg_extractor.py

    pdf-extraction regulatory-documents llm-extraction ocr data-extraction
  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator
  • class FileCloudAPI

    Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

    File: /tf/active/vicechatdev/FC_api copy.py

    class filecloudapi
  • class FileCloudAPI_v1

    Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

    File: /tf/active/vicechatdev/FC_api.py

    class filecloudapi
  • class DocumentProcessor_v5

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

    class documentprocessor
  • class DocumentDetail_v1

    Document detail view component

    File: /tf/active/vicechatdev/document_detail_old.py

    class documentdetail
  • function analyze_flock_type_patterns

    Analyzes and prints timing pattern statistics for flock data by categorizing issues that occur before start time and after end time, grouped by flock type.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-analysis pandas timing-patterns flock-management aggregation
  • class QueryBasedExtractor_v2

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    information-extraction text-processing llm openai query-based
  • class OneCo_hybrid_RAG_v2

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag
  • class DocumentProcessor_v6

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi.py

    class documentprocessor
  • function test_mixed_previous_reports

    A test function that validates the DocumentExtractor's ability to extract text content from multiple file formats (TXT and Markdown) and combine them into a unified previous reports summary.

    File: /tf/active/vicechatdev/leexi/test_enhanced_reports.py

    testing document-extraction file-processing integration-test text-extraction
  • class PowerPointProcessor

    A class that processes PowerPoint (.pptx) presentations to extract text content and tables, converting tables to markdown format and organizing content by slides.

    File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

    powerpoint pptx document-processing text-extraction table-extraction
  • class EnhancedMeetingMinutesGenerator

    A class named EnhancedMeetingMinutesGenerator

    File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

    class enhancedmeetingminutesgenerator
  • class SharePointFileCloudSync

    Orchestrates synchronization of documents from SharePoint to FileCloud, managing the complete sync lifecycle including document retrieval, comparison, upload, and folder structure creation.

    File: /tf/active/vicechatdev/SPFCsync/sync_service.py

    synchronization sharepoint filecloud document-management cloud-sync
  • class QueryBasedExtractor

    A class that extracts relevant information from documents using a small LLM (Language Model), designed for Extensive and Full Reading modes in RAG systems.

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    information-extraction document-processing llm rag query-based
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • class DocumentProcessor_v4

    Handles document processing and text extraction using llmsherpa (same approach as offline_docstore_multi_vice.py).

    File: /tf/active/vicechatdev/docchat/document_processor.py

    class documentprocessor
  • class DocumentProcessor_v1

    A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_new.py

    document-processing text-extraction pdf-processing word-processing llmsherpa
  • class LLMClient_v2

    Client for interacting with LLM providers (OpenAI, Anthropic, Azure, etc.)

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/llm_client.py

    class llmclient
  • class DocumentProcessor_v2

    A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

    File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_old.py

    document-processing text-extraction pdf-processing word-processing llmsherpa
  • class TextClusterer

    A class that clusters similar documents based on their embeddings using various clustering algorithms (K-means, Agglomerative, DBSCAN) and optionally generates summaries for each cluster.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/clustering/text_clusterer.py

    clustering document-clustering embeddings machine-learning kmeans
  • function read_excel_file

    Reads Excel files and returns either metadata for all sheets or detailed data for a specific sheet, including format validation, European decimal conversion, and rich metadata extraction.

    File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

    excel data-loading pandas file-io validation
  • class DocumentProcessor_v7

    Lightweight document processor for chat upload functionality

    File: /tf/active/vicechatdev/vice_ai/document_processor.py

    class documentprocessor
  • class LLMClient_v1

    Multi-LLM client that provides a unified interface for interacting with OpenAI GPT-4o, Azure OpenAI, Google Gemini, and Anthropic Claude models.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    llm openai azure gemini claude
  • function export_to_docx

    Exports a document with text and data sections to Microsoft Word DOCX format, preserving formatting, structure, and metadata.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    document-export docx word-document file-generation content-formatting
  • function export_to_pdf

    Exports a document with text and data sections to a PDF file using ReportLab, handling custom styling, section ordering, and content formatting including Quill Delta to HTML/Markdown conversion.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    pdf-export document-generation reportlab content-formatting quill-delta
  • class QueryBasedExtractor_v1

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    information-extraction llm openai text-processing query-based
  • class OneCo_hybrid_RAG_v3

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    class oneco_hybrid_rag
  • class ExtensiveSearchManager_v1

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    class extensivesearchmanager
  • class StatisticalAgent

    LLM-powered statistical analysis agent

    File: /tf/active/vicechatdev/vice_ai/statistical_agent.py

    class statisticalagent
  • class ControlledDocumentFlaskApp

    Main Flask application class for Controlled Document Management System.

    File: /tf/active/vicechatdev/CDocs/main_flask.py

    class controlleddocumentflaskapp
  • function get_user_permissions

    Retrieves all permissions for a user by aggregating permissions from their assigned roles, with fallback to default USER role permissions.

    File: /tf/active/vicechatdev/CDocs/config/permissions.py

    authorization permissions rbac role-based-access-control security
  • function manage_document_permissions

    Comprehensive function to manage document sharing and user permissions. This function: 1. Creates a share only if needed for active users 2. Adds/updates users with appropriate permissions based on their roles 3. Removes users who shouldn't have access anymore 4. Cleans up shares that are no longer needed 5. Manages ACL entries for write permissions on the document's folder Args: document: The document to manage permissions for Returns: Dict: Result of permission updates with detailed information

    File: /tf/active/vicechatdev/CDocs/controllers/share_controller.py

    function manage_document_permissions
  • class PDFManipulator

    Manipulates existing PDF documents This class provides methods to add watermarks, merge PDFs, extract pages, and perform other manipulation operations.

    File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

    class pdfmanipulator
  • function merge_pdfs

    Merges multiple PDF files into a single consolidated PDF document by delegating to a PDFManipulator instance.

    File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

    pdf merge combine document-processing file-manipulation
  • class VersionComparisonService

    A service class that compares two versions of a document using LLM-based analysis, implementing smart segmentation and chunking for handling large documents efficiently.

    File: /tf/active/vicechatdev/CDocs/utils/version_comparison.py

    document-comparison version-control llm openai text-analysis
  • function check_document_permissions_on_startup

    Validates and fixes document permission issues during application startup, prioritizing active documents (DRAFT, IN_REVIEW, IN_APPROVAL) to ensure proper sharing permissions are configured.

    File: /tf/active/vicechatdev/CDocs/utils/sharing_validator.py

    startup initialization document-permissions validation permissions
  • class TrainingCompletion

    UI component for completing training requirements.

    File: /tf/active/vicechatdev/CDocs/ui/training_completion.py

    class trainingcompletion
  • class TrainingManagement

    UI component for managing document training.

    File: /tf/active/vicechatdev/CDocs/ui/training_management.py

    class trainingmanagement
  • class DocumentDetail_v2

    Document detail view component

    File: /tf/active/vicechatdev/CDocs/ui/document_detail.py

    class documentdetail
  • class UserTasksPanel

    Panel showing pending tasks for the current user

    File: /tf/active/vicechatdev/CDocs/ui/user_tasks_panel.py

    class usertaskspanel
  • class ApprovalPanel

    Approval management interface component

    File: /tf/active/vicechatdev/CDocs/ui/approval_panel_bis.py

    class approvalpanel
  • class TrainingDashboard

    Training dashboard for users to view and complete training requirements.

    File: /tf/active/vicechatdev/CDocs/ui/training_dashboard.py

    class trainingdashboard
  • class EnhancedSQLWorkflow

    Enhanced SQL workflow with iterative optimization

    File: /tf/active/vicechatdev/full_smartstat/enhanced_sql_workflow.py

    class enhancedsqlworkflow
  • class VendorEmailExtractor

    Extract vendor email addresses from all organizational mailboxes

    File: /tf/active/vicechatdev/find_email/vendor_email_extractor.py

    class vendoremailextractor
  • function max_range

    Computes the maximal lower and upper bounds from a list of range tuples, handling various data types including numeric, datetime, and string values.

    File: /tf/active/vicechatdev/patches/util.py

    data-processing range-computation bounds aggregation datetime-handling
  • function mimebundle_to_html

    Converts a MIME bundle (dictionary or tuple of data and metadata) into HTML string representation, including any embedded JavaScript.

    File: /tf/active/vicechatdev/patches/util.py

    mime-bundle html-conversion jupyter rich-display javascript
  • class HoloMap

    HoloMap is an n-dimensional mapping container that stores viewable elements or overlays indexed by tuple keys along declared key dimensions, enabling interactive exploration through widgets.

    File: /tf/active/vicechatdev/patches/spaces.py

    visualization mapping multi-dimensional interactive container
  • class GoogleSearchClient

    A client class for performing Google searches using the Serper API through LangChain's GoogleSerperAPIWrapper, providing both single and batch search capabilities.

    File: /tf/active/vicechatdev/QA_updater/data_access/google_search_client.py

    google-search serper-api web-search langchain api-client
  • class DocumentProcessor_v3

    A comprehensive PDF document processor that handles text extraction, OCR (Optical Character Recognition), layout analysis, table detection, and metadata extraction from PDF files.

    File: /tf/active/vicechatdev/invoice_extraction/core/document_processor.py

    pdf-processing ocr text-extraction document-processing invoice-processing

Search Examples