Search - Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "combine"

Found 50 matching component(s)

class RegulatoryExtractor

A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

File: /tf/active/vicechatdev/reg_extractor.py

pdf-extraction regulatory-documents llm-extraction ocr data-extraction
class FixedProjectVictoriaGenerator

Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

class fixedprojectvictoriagenerator
class FileCloudAPI

Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

File: /tf/active/vicechatdev/FC_api copy.py

class filecloudapi
class FileCloudAPI_v1

Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

File: /tf/active/vicechatdev/FC_api.py

class filecloudapi
class DocumentProcessor_v5

Process different document types for RAG context extraction

File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

class documentprocessor
class DocumentDetail_v1

Document detail view component

File: /tf/active/vicechatdev/document_detail_old.py

class documentdetail
function analyze_flock_type_patterns

Analyzes and prints timing pattern statistics for flock data by categorizing issues that occur before start time and after end time, grouped by flock type.

File: /tf/active/vicechatdev/data_quality_dashboard.py

data-analysis pandas timing-patterns flock-management aggregation
class QueryBasedExtractor_v2

A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

information-extraction text-processing llm openai query-based
class OneCo_hybrid_RAG_v2

A class named OneCo_hybrid_RAG

File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

class oneco_hybrid_rag
class DocumentProcessor_v6

Process different document types for RAG context extraction

File: /tf/active/vicechatdev/offline_docstore_multi.py

class documentprocessor
function test_mixed_previous_reports

A test function that validates the DocumentExtractor's ability to extract text content from multiple file formats (TXT and Markdown) and combine them into a unified previous reports summary.

File: /tf/active/vicechatdev/leexi/test_enhanced_reports.py

testing document-extraction file-processing integration-test text-extraction
class PowerPointProcessor

A class that processes PowerPoint (.pptx) presentations to extract text content and tables, converting tables to markdown format and organizing content by slides.

File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

powerpoint pptx document-processing text-extraction table-extraction
class EnhancedMeetingMinutesGenerator

A class named EnhancedMeetingMinutesGenerator

File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

class enhancedmeetingminutesgenerator
class SharePointFileCloudSync

Orchestrates synchronization of documents from SharePoint to FileCloud, managing the complete sync lifecycle including document retrieval, comparison, upload, and folder structure creation.

File: /tf/active/vicechatdev/SPFCsync/sync_service.py

synchronization sharepoint filecloud document-management cloud-sync
class QueryBasedExtractor

A class that extracts relevant information from documents using a small LLM (Language Model), designed for Extensive and Full Reading modes in RAG systems.

File: /tf/active/vicechatdev/docchat/rag_engine.py

information-extraction document-processing llm rag query-based
class DocChatRAG

Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

File: /tf/active/vicechatdev/docchat/rag_engine.py

class docchatrag
class DocumentProcessor_v4

Handles document processing and text extraction using llmsherpa (same approach as offline_docstore_multi_vice.py).

File: /tf/active/vicechatdev/docchat/document_processor.py

class documentprocessor
class DocumentProcessor_v1

A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_new.py

document-processing text-extraction pdf-processing word-processing llmsherpa
class LLMClient_v2

Client for interacting with LLM providers (OpenAI, Anthropic, Azure, etc.)

File: /tf/active/vicechatdev/contract_validity_analyzer/utils/llm_client.py

class llmclient
class DocumentProcessor_v2

A document processing class that extracts text from PDF and Word documents using llmsherpa as the primary method with fallback support for PyPDF2, pdfplumber, and python-docx.

File: /tf/active/vicechatdev/contract_validity_analyzer/utils/document_processor_old.py

document-processing text-extraction pdf-processing word-processing llmsherpa
class TextClusterer

A class that clusters similar documents based on their embeddings using various clustering algorithms (K-means, Agglomerative, DBSCAN) and optionally generates summaries for each cluster.

File: /tf/active/vicechatdev/chromadb-cleanup/src/clustering/text_clusterer.py

clustering document-clustering embeddings machine-learning kmeans
function read_excel_file

Reads Excel files and returns either metadata for all sheets or detailed data for a specific sheet, including format validation, European decimal conversion, and rich metadata extraction.

File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

excel data-loading pandas file-io validation
class DocumentProcessor_v7

Lightweight document processor for chat upload functionality

File: /tf/active/vicechatdev/vice_ai/document_processor.py

class documentprocessor
class LLMClient_v1

Multi-LLM client that provides a unified interface for interacting with OpenAI GPT-4o, Azure OpenAI, Google Gemini, and Anthropic Claude models.

File: /tf/active/vicechatdev/vice_ai/new_app.py

llm openai azure gemini claude
function export_to_docx

Exports a document with text and data sections to Microsoft Word DOCX format, preserving formatting, structure, and metadata.

File: /tf/active/vicechatdev/vice_ai/new_app.py

document-export docx word-document file-generation content-formatting
function export_to_pdf

Exports a document with text and data sections to a PDF file using ReportLab, handling custom styling, section ordering, and content formatting including Quill Delta to HTML/Markdown conversion.

File: /tf/active/vicechatdev/vice_ai/new_app.py

pdf-export document-generation reportlab content-formatting quill-delta
class QueryBasedExtractor_v1

A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

information-extraction llm openai text-processing query-based
class OneCo_hybrid_RAG_v3

A class named OneCo_hybrid_RAG

File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

class oneco_hybrid_rag
class ExtensiveSearchManager_v1

Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

class extensivesearchmanager
class StatisticalAgent

LLM-powered statistical analysis agent

File: /tf/active/vicechatdev/vice_ai/statistical_agent.py

class statisticalagent
class ControlledDocumentFlaskApp

Main Flask application class for Controlled Document Management System.

File: /tf/active/vicechatdev/CDocs/main_flask.py

class controlleddocumentflaskapp
function get_user_permissions

Retrieves all permissions for a user by aggregating permissions from their assigned roles, with fallback to default USER role permissions.

File: /tf/active/vicechatdev/CDocs/config/permissions.py

authorization permissions rbac role-based-access-control security
function manage_document_permissions

Comprehensive function to manage document sharing and user permissions. This function: 1. Creates a share only if needed for active users 2. Adds/updates users with appropriate permissions based on their roles 3. Removes users who shouldn't have access anymore 4. Cleans up shares that are no longer needed 5. Manages ACL entries for write permissions on the document's folder Args: document: The document to manage permissions for Returns: Dict: Result of permission updates with detailed information

File: /tf/active/vicechatdev/CDocs/controllers/share_controller.py

function manage_document_permissions
class PDFManipulator

Manipulates existing PDF documents This class provides methods to add watermarks, merge PDFs, extract pages, and perform other manipulation operations.

File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

class pdfmanipulator
function merge_pdfs

Merges multiple PDF files into a single consolidated PDF document by delegating to a PDFManipulator instance.

File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

pdf merge combine document-processing file-manipulation
class VersionComparisonService

A service class that compares two versions of a document using LLM-based analysis, implementing smart segmentation and chunking for handling large documents efficiently.

File: /tf/active/vicechatdev/CDocs/utils/version_comparison.py

document-comparison version-control llm openai text-analysis
function check_document_permissions_on_startup

Validates and fixes document permission issues during application startup, prioritizing active documents (DRAFT, IN_REVIEW, IN_APPROVAL) to ensure proper sharing permissions are configured.

File: /tf/active/vicechatdev/CDocs/utils/sharing_validator.py

startup initialization document-permissions validation permissions
class TrainingCompletion

UI component for completing training requirements.

File: /tf/active/vicechatdev/CDocs/ui/training_completion.py

class trainingcompletion
class TrainingManagement

UI component for managing document training.

File: /tf/active/vicechatdev/CDocs/ui/training_management.py

class trainingmanagement
class DocumentDetail_v2

Document detail view component

File: /tf/active/vicechatdev/CDocs/ui/document_detail.py

class documentdetail
class UserTasksPanel

Panel showing pending tasks for the current user

File: /tf/active/vicechatdev/CDocs/ui/user_tasks_panel.py

class usertaskspanel
class ApprovalPanel_v1

Approval management interface component

File: /tf/active/vicechatdev/CDocs/ui/approval_panel_bis.py

class approvalpanel
class TrainingDashboard

Training dashboard for users to view and complete training requirements.

File: /tf/active/vicechatdev/CDocs/ui/training_dashboard.py

class trainingdashboard
class EnhancedSQLWorkflow

Enhanced SQL workflow with iterative optimization

File: /tf/active/vicechatdev/full_smartstat/enhanced_sql_workflow.py

class enhancedsqlworkflow
class VendorEmailExtractor

Extract vendor email addresses from all organizational mailboxes

File: /tf/active/vicechatdev/find_email/vendor_email_extractor.py

class vendoremailextractor
function max_range

Computes the maximal lower and upper bounds from a list of range tuples, handling various data types including numeric, datetime, and string values.

File: /tf/active/vicechatdev/patches/util.py

data-processing range-computation bounds aggregation datetime-handling
function mimebundle_to_html

Converts a MIME bundle (dictionary or tuple of data and metadata) into HTML string representation, including any embedded JavaScript.

File: /tf/active/vicechatdev/patches/util.py

mime-bundle html-conversion jupyter rich-display javascript
class HoloMap

HoloMap is an n-dimensional mapping container that stores viewable elements or overlays indexed by tuple keys along declared key dimensions, enabling interactive exploration through widgets.

File: /tf/active/vicechatdev/patches/spaces.py

visualization mapping multi-dimensional interactive container
class GoogleSearchClient

A client class for performing Google searches using the Serper API through LangChain's GoogleSerperAPIWrapper, providing both single and batch search capabilities.

File: /tf/active/vicechatdev/QA_updater/data_access/google_search_client.py

google-search serper-api web-search langchain api-client
class DocumentProcessor_v3

A comprehensive PDF document processor that handles text extraction, OCR (Optical Character Recognition), layout analysis, table detection, and metadata extraction from PDF files.

File: /tf/active/vicechatdev/invoice_extraction/core/document_processor.py

pdf-processing ocr text-extraction document-processing invoice-processing

Search Examples

validation - Find validation functions
database - Find database-related components
email - Find email processing functions
api - Find API-related components
authentication - Find auth components

Search Components

Search Results for "combine"

class RegulatoryExtractor

class FixedProjectVictoriaGenerator

class FileCloudAPI

class FileCloudAPI_v1

class DocumentProcessor_v5

class DocumentDetail_v1

function analyze_flock_type_patterns

class QueryBasedExtractor_v2

class OneCo_hybrid_RAG_v2

class DocumentProcessor_v6

function test_mixed_previous_reports

class PowerPointProcessor

class EnhancedMeetingMinutesGenerator

class SharePointFileCloudSync

class QueryBasedExtractor

class DocChatRAG

class DocumentProcessor_v4

class DocumentProcessor_v1

class LLMClient_v2

class DocumentProcessor_v2

class TextClusterer

function read_excel_file

class DocumentProcessor_v7

class LLMClient_v1

function export_to_docx

function export_to_pdf

class QueryBasedExtractor_v1

class OneCo_hybrid_RAG_v3

class ExtensiveSearchManager_v1

class StatisticalAgent

class ControlledDocumentFlaskApp

function get_user_permissions

function manage_document_permissions

class PDFManipulator

function merge_pdfs

class VersionComparisonService

function check_document_permissions_on_startup

class TrainingCompletion

class TrainingManagement

class DocumentDetail_v2

class UserTasksPanel

class ApprovalPanel_v1

class TrainingDashboard

class EnhancedSQLWorkflow

class VendorEmailExtractor

function max_range

function mimebundle_to_html

class HoloMap

class GoogleSearchClient

class DocumentProcessor_v3

Search Examples