🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "extractor"

Found 40 matching component(s)

  • class PatternBasedExtractor

    Extract flocks based on farm-level In-Ovo usage patterns.

    File: /tf/active/vicechatdev/pattern_based_extraction.py

    class patternbasedextractor
  • function main_v5

    Command-line interface function that orchestrates pattern-based extraction of poultry flock data, including data loading, pattern classification, geocoding, and export functionality.

    File: /tf/active/vicechatdev/pattern_based_extraction.py

    cli command-line-interface data-extraction poultry-data pattern-analysis
  • class QueryBasedExtractor_v2

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    information-extraction text-processing llm openai query-based
  • class OneCo_hybrid_RAG_v2

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag
  • function test_mixed_previous_reports

    A test function that validates the DocumentExtractor's ability to extract text content from multiple file formats (TXT and Markdown) and combine them into a unified previous reports summary.

    File: /tf/active/vicechatdev/leexi/test_enhanced_reports.py

    testing document-extraction file-processing integration-test text-extraction
  • class DocumentExtractor

    A document text extraction class that supports multiple file formats including Word, PowerPoint, PDF, and plain text files, with automatic format detection and conversion capabilities.

    File: /tf/active/vicechatdev/leexi/document_extractor.py

    document-processing text-extraction pdf word powerpoint
  • function test_document_extractor

    A test function that validates the DocumentExtractor class by testing file type support detection, text extraction from various document formats, and error handling.

    File: /tf/active/vicechatdev/leexi/test_document_extractor.py

    testing document-extraction file-processing validation text-extraction
  • function extract_previous_reports_summary

    Extracts and summarizes key information from previous meeting report files using document extraction and OpenAI's GPT-4o-mini model to provide context for upcoming meetings.

    File: /tf/active/vicechatdev/leexi/app.py

    meeting-analysis document-extraction text-summarization llm openai
  • function test_multiple_files

    A test function that validates the extraction of text content from multiple document files using a DocumentExtractor instance, displaying extraction results and simulating combined content processing.

    File: /tf/active/vicechatdev/leexi/test_multiple_files.py

    testing document-extraction file-processing text-extraction multiple-files
  • class QueryBasedExtractor

    A class that extracts relevant information from documents using a small LLM (Language Model), designed for Extensive and Full Reading modes in RAG systems.

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    information-extraction document-processing llm rag query-based
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • class ContractDataExtractor

    Extract structured data from legal contracts using LLM analysis

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    class contractdataextractor
  • function parse_arguments_v1

    Parses command-line arguments for a legal contract extraction tool that processes documents from FileCloud storage.

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    cli command-line argument-parsing argparse configuration
  • function main

    Main entry point function for a Legal Contract Data Extractor application that processes contracts from FileCloud, extracts data, and exports results to multiple formats (CSV, Excel, JSON).

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    main-entry-point contract-extraction data-processing filecloud csv-export
  • function test_llm_extraction

    A test function that validates LLM-based contract data extraction by processing a sample contract and verifying the extracted fields against expected values.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_extractor.py

    testing contract-extraction llm validation integration-test
  • class ContractAnalyzer

    Main class for analyzing contract validity from FileCloud documents.

    File: /tf/active/vicechatdev/contract_validity_analyzer/core/analyzer.py

    class contractanalyzer
  • class QueryBasedExtractor_v1

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    information-extraction llm openai text-processing query-based
  • class OneCo_hybrid_RAG_v3

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/vice_ai/hybrid_rag_engine.py

    class oneco_hybrid_rag
  • class PDFTextExtractor

    A class for extracting text, images, and structured content from PDF documents with layout preservation capabilities.

    File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

    pdf text-extraction document-processing layout-analysis markdown-conversion
  • function extract_text_from_pdf

    Extracts all text content from a PDF document and returns it as a string.

    File: /tf/active/vicechatdev/CDocs/utils/pdf_utils.py

    pdf text-extraction document-processing file-io pdf-parsing
  • function main_v27

    Demonstrates example usage of the VendorEmailExtractor class by searching for vendor emails across Office 365 mailboxes and displaying results.

    File: /tf/active/vicechatdev/find_email/vendor_email_extractor.py

    example demo vendor-email-extraction office365 microsoft-graph
  • function extract_batch

    Batch processes a list of vendors from an Excel file to extract their email addresses by searching through Microsoft 365 mailboxes using AI-powered email analysis.

    File: /tf/active/vicechatdev/find_email/extract_vendor_batch.py

    email-extraction vendor-management batch-processing microsoft-graph-api openai
  • function test_authentication

    Tests Azure AD authentication for a VendorEmailExtractor instance by attempting to authenticate and providing detailed troubleshooting guidance on failure.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    authentication azure-ad testing diagnostics troubleshooting
  • function test_mailbox_access

    Tests the ability to access and retrieve mailboxes from Microsoft Graph API through a VendorEmailExtractor instance, displaying results and troubleshooting information.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    testing microsoft-graph mailbox diagnostics permissions
  • function test_email_search

    Tests the email search functionality of a VendorEmailExtractor instance by searching for emails containing common business terms in the first available mailbox.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    testing email search microsoft-graph mailbox
  • function test_llm_connectivity

    Tests the connectivity and functionality of an OpenAI LLM integration by analyzing a mock email with vendor information extraction.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    testing llm openai connectivity diagnostic
  • function run_all_tests

    Orchestrates a comprehensive test suite for the Vendor Email Extractor system, verifying configuration, authentication, mailbox access, email search, and LLM connectivity.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    testing verification setup validation integration-test
  • class Config_v5

    A hierarchical configuration manager that loads and manages settings from multiple sources (defaults, files, environment variables) with support for nested structures and dynamic updates.

    File: /tf/active/vicechatdev/invoice_extraction/config.py

    configuration settings config-management yaml json
  • class InvoiceProcessor

    Main orchestrator class that coordinates the complete invoice processing pipeline from PDF extraction through validation to Excel generation.

    File: /tf/active/vicechatdev/invoice_extraction/main.py

    invoice-processing document-processing pdf-extraction entity-classification language-detection
  • class TestBaseExtractor

    Unit test class for testing the BaseExtractor class, which provides comprehensive test coverage for document extraction functionality including initialization, structure extraction, bounding box text retrieval, and confidence calculation.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py

    unit-testing document-extraction test-case base-extractor confidence-calculation
  • class MockLLMClient

    A mock implementation of an LLM client designed for testing extractor components without making actual API calls to language models.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py

    testing mock llm unit-test test-double
  • class TestUKExtractor

    Unit test class for testing the UKExtractor class, which extracts structured data from UK invoices including VAT numbers, dates, amounts, and line items.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py

    unit-test invoice-extraction uk-invoices vat-validation date-parsing
  • class TestBEExtractor

    Unit test class for testing the BEExtractor class, which extracts structured data from Belgian invoices using LLM-based extraction.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py

    unit-testing invoice-extraction belgian-invoices llm-testing data-extraction
  • class TestAUExtractor

    Unit test class for testing the AUExtractor class, which extracts data from Australian invoices including ABN, GST, and payment details.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py

    unit-test invoice-extraction australian-invoices ABN-formatting GST-handling
  • class BEExtractor

    Belgium-specific invoice data extractor that uses LLM (Large Language Model) to extract structured invoice data from Belgian invoices in multiple languages (English, French, Dutch).

    File: /tf/active/vicechatdev/invoice_extraction/extractors/be_extractor.py

    invoice-extraction belgium llm ocr document-processing
  • class AUExtractor

    Australia-specific invoice data extractor that uses LLM (Large Language Model) to extract structured invoice data from Australian tax invoices, handling ABN, ACN, GST, BSB numbers and Australian date formats.

    File: /tf/active/vicechatdev/invoice_extraction/extractors/au_extractor.py

    invoice-extraction australia llm ocr document-processing
  • class BaseExtractor

    Abstract base class that defines the interface and shared functionality for entity-specific invoice data extractors (UK, BE, AU), providing a multi-stage extraction pipeline for invoice processing.

    File: /tf/active/vicechatdev/invoice_extraction/extractors/base_extractor.py

    invoice-processing data-extraction abstract-base-class OCR document-processing
  • class UKExtractor

    UK-specific invoice data extractor.

    File: /tf/active/vicechatdev/invoice_extraction/extractors/uk_extractor.py

    class ukextractor
  • class OneCo_hybrid_RAG_v4

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/datacapture_backup_16072025/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag
  • class OneCo_hybrid_RAG_v5

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/data_capture_backup_18072025/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag

Search Examples