🔍 Code Extractor

Browse Components

Showing 20 of 1729 components

  • class TestBEExtractor

    Unit test class for testing the BEExtractor class, which extracts structured data from Belgian invoices using LLM-based extraction.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py | Lines: 305-474

    unit-testing invoice-extraction belgian-invoices llm-testing data-extraction
  • class TestUKExtractor

    Unit test class for testing the UKExtractor class, which extracts structured data from UK invoices including VAT numbers, dates, amounts, and line items.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py | Lines: 135-302

    unit-test invoice-extraction uk-invoices vat-validation date-parsing
  • class MockLLMClient

    A mock implementation of an LLM client designed for testing extractor components without making actual API calls to language models.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py | Lines: 115-132

    testing mock llm unit-test test-double
  • class TestBaseExtractor

    Unit test class for testing the BaseExtractor class, which provides comprehensive test coverage for document extraction functionality including initialization, structure extraction, bounding box text retrieval, and confidence calculation.

    File: /tf/active/vicechatdev/invoice_extraction/tests/test_extractors.py | Lines: 17-112

    unit-testing document-extraction test-case base-extractor confidence-calculation
  • class EntityClassifier

    Classifies which ViceBio entity (UK, Belgium, or Australia) an invoice is addressed to using rule-based pattern matching and LLM fallback.

    File: /tf/active/vicechatdev/invoice_extraction/core/entity_classifier.py | Lines: 7-199

    classification entity-recognition invoice-processing pattern-matching regex
  • class ExcelGenerator

    Generates structured Excel files from extracted invoice data.

    File: /tf/active/vicechatdev/invoice_extraction/core/excel_generator.py | Lines: 11-502

    class excelgenerator
  • class LanguageDetector

    A language detection class that identifies whether invoice documents are written in English, French, or Dutch using both rule-based keyword matching and LLM-based detection.

    File: /tf/active/vicechatdev/invoice_extraction/core/language_detector.py | Lines: 10-236

    language-detection nlp invoice-processing text-analysis multilingual
  • class DocumentProcessor_v3

    A comprehensive PDF document processor that handles text extraction, OCR (Optical Character Recognition), layout analysis, table detection, and metadata extraction from PDF files.

    File: /tf/active/vicechatdev/invoice_extraction/core/document_processor.py | Lines: 12-451

    pdf-processing ocr text-extraction document-processing invoice-processing
  • class DocumentProcessor_v8

    Process different document types for indexing

    File: /tf/active/vicechatdev/docchat/document_indexer.py | Lines: 230-787

    class documentprocessor
  • class DocumentProcessor_v7

    Lightweight document processor for chat upload functionality

    File: /tf/active/vicechatdev/vice_ai/document_processor.py | Lines: 97-1028

    class documentprocessor
  • class DocumentProcessor_v6

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi.py | Lines: 189-1177

    class documentprocessor
  • class DocumentProcessor_v5

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi_vice.py | Lines: 197-1216

    class documentprocessor
  • class DocumentProcessor_v4

    Handles document processing and text extraction using llmsherpa (same approach as offline_docstore_multi_vice.py).

    File: /tf/active/vicechatdev/docchat/document_processor.py | Lines: 44-611

    class documentprocessor
  • function main_v4

    Main entry point function for an invoice processing system that monitors an inbound directory for PDF invoices, processes them using LLM extraction, generates Excel outputs, and moves processed files to a processed directory.

    File: /tf/active/vicechatdev/invoice_extraction/main.py | Lines: 193-270

    invoice-processing pdf-processing llm-extraction batch-processing file-monitoring
  • function main_v28

    Command-line entry point that parses arguments and orchestrates the extraction of vendor emails from all vicebio.com mailboxes using Microsoft Graph API.

    File: /tf/active/vicechatdev/find_email/extract_vendor_batch.py | Lines: 118-172

    cli command-line entry-point argparse email-extraction
  • function main_v27

    Demonstrates example usage of the VendorEmailExtractor class by searching for vendor emails across Office 365 mailboxes and displaying results.

    File: /tf/active/vicechatdev/find_email/vendor_email_extractor.py | Lines: 948-1000

    example demo vendor-email-extraction office365 microsoft-graph
  • function main_v26

    Orchestrates a complete correlation analysis pipeline for Eimeria infection and broiler performance data, from data loading through visualization and results export.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py | Lines: 489-546

    data-analysis correlation statistics veterinary agriculture
  • function main_v25

    Orchestrates and executes a comprehensive test suite for the Vice AI Data Analysis Integration, running multiple test functions, creating test datasets, and providing detailed pass/fail reporting.

    File: /tf/active/vicechatdev/vice_ai/test_integration.py | Lines: 154-210

    testing integration-tests test-runner data-analysis test-orchestration
  • function main_v24

    Interactive CLI function that allows users to select and run document processing test scenarios with varying document counts, providing feedback on test success and next steps.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_real_documents.py | Lines: 166-209

    cli interactive testing document-processing user-input
  • function main_v23

    Main test function that validates SharePoint folder structure connectivity and configuration, comparing actual folders against expected structure.

    File: /tf/active/vicechatdev/SPFCsync/test_folder_structure.py | Lines: 215-258

    testing sharepoint integration-test folder-structure validation