🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "pipeline"

Found 50 matching component(s)

  • function main_v17

    Converts a markdown file containing warranty disclosure data into multiple tabular formats (CSV, Excel, Word) with timestamped output files.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    markdown-conversion data-extraction report-generation csv-export excel-export
  • function main_v1

    Main orchestration function that reads an improved markdown file and converts it to an enhanced Word document with comprehensive formatting, including table of contents, warranty sections, disclosures, and bibliography.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    document-generation word-processing markdown-conversion docx file-processing
  • class MyEmbeddingFunction_v1

    A custom embedding function class that generates embeddings for documents using OpenAI's API, with built-in text summarization for long documents and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    embeddings openai chromadb vector-database text-summarization
  • class SimpleDataHandle

    A data handler class that manages multiple data sources with different types (dataframes, vector stores, databases) and their associated processing configurations.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    data-management registry vector-store RAG dataframe
  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator
  • function main_v59

    Entry point function that instantiates a FixedProjectVictoriaGenerator and executes its complete pipeline to generate fixed disclosure documents.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    entry-point pipeline disclosure-generation orchestration main-function
  • class ImprovedProjectVictoriaGenerator

    Improved Project Victoria Disclosure Generator with proper reference management.

    File: /tf/active/vicechatdev/improved_project_victoria_generator.py

    class improvedprojectvictoriagenerator
  • function main_v63

    Entry point function that instantiates an ImprovedProjectVictoriaGenerator and executes its complete pipeline to generate disclosure documents.

    File: /tf/active/vicechatdev/improved_project_victoria_generator.py

    entry-point main-function disclosure-generation RAG document-generation
  • class ProjectVictoriaDisclosureGenerator

    Main class for generating Project Victoria disclosures from warranty claims.

    File: /tf/active/vicechatdev/project_victoria_disclosure_generator.py

    class projectvictoriadisclosuregenerator
  • function test_attendee_extraction_comprehensive

    A comprehensive test function that validates the attendee extraction logic from meeting transcripts, comparing actual speakers versus mentioned names, and demonstrating integration with meeting minutes generation.

    File: /tf/active/vicechatdev/leexi/test_attendee_comprehensive.py

    testing attendee-extraction meeting-minutes transcript-parsing speaker-identification
  • function test_document_processor

    A test function that validates the DocumentProcessor component's ability to extract text from PDF files with improved error handling and llmsherpa integration.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_improved_processor.py

    testing document-processing pdf-extraction text-extraction integration-test
  • function test_local_document

    Integration test function that validates end date extraction from a local PDF document using document processing and LLM-based analysis.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_local_document.py

    testing integration-test document-processing pdf-extraction llm
  • function test_with_simulated_content

    Tests LLM-based contract analysis prompts using simulated NDA content containing a term clause to verify extraction of contract dates and metadata.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_local_document.py

    testing contract-analysis llm nda date-extraction
  • function test_single_document

    Tests end date extraction from a specific PDF document by downloading it from FileCloud, extracting text, and using LLM-based analysis to identify contract expiry dates.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_single_document.py

    testing integration-test document-processing pdf-extraction contract-analysis
  • function test_full_analyzer

    Tests the full contract analyzer pipeline by running a dry-run analysis on a limited number of files to verify the system is working correctly.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_implementation.py

    testing integration-test contract-analysis pipeline-validation dry-run
  • class ContractDataExtractor

    Extract structured data from legal contracts using LLM analysis

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    class contractdataextractor
  • function test_with_real_documents

    Tests a contract analyzer system by processing real documents from FileCloud, extracting contract information, and generating analysis reports with performance metrics.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_real_documents.py

    testing integration-test contract-analysis document-processing filecloud
  • function main_v6

    Main entry point function for the Contract Validity Analyzer application that orchestrates configuration loading, logging setup, FileCloud connection, and contract analysis execution.

    File: /tf/active/vicechatdev/contract_validity_analyzer/main.py

    entry-point main-function cli-application contract-analysis filecloud
  • function test_ocr_retry_logic

    Tests the OCR retry logic for extracting contract end dates by first attempting normal text extraction, then falling back to OCR-based extraction if the end date is not found.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_ocr_retry.py

    testing ocr document-processing pdf-extraction contract-analysis
  • function test_simulated_document

    Integration test function that validates end date extraction from a simulated contract document containing an explicit term clause, using a two-step LLM-based analysis process.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_simulated_document.py

    testing integration-test contract-analysis date-extraction llm-client
  • function test_llm_extraction

    A test function that validates LLM-based contract data extraction by processing a sample contract and verifying the extracted fields against expected values.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_extractor.py

    testing contract-extraction llm validation integration-test
  • function main_v7

    Main entry point function that orchestrates the contract validity analysis workflow by loading configuration, setting up logging, initializing the analyzer, running analysis, and reporting results.

    File: /tf/active/vicechatdev/contract_validity_analyzer/core/analyzer.py

    entry-point orchestration contract-analysis workflow configuration
  • function main_v50

    Entry point function that demonstrates document processing workflow by creating an audited, watermarked, and protected PDF/A document from a DOCX file with audit trail data.

    File: /tf/active/vicechatdev/document_auditor/main.py

    document-processing pdf-generation audit-trail watermarking pdf-a-compliance
  • function main_v60

    Command-line interface function that orchestrates the cleaning of ChromaDB collections by removing duplicates and similar documents, with options to skip collections and customize the cleaning process.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    cli command-line chromadb database-cleaning deduplication
  • function load_data_from_chromadb

    Connects to a ChromaDB instance and retrieves all documents from a specified collection, returning them as a list of dictionaries with document IDs, text content, embeddings, and metadata.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    chromadb vector-database data-loading document-retrieval embeddings
  • function main_v51

    Command-line interface function that orchestrates a ChromaDB collection cleaning pipeline by removing duplicate and similar documents through hashing and similarity screening.

    File: /tf/active/vicechatdev/chromadb-cleanup/main copy.py

    cli command-line data-cleaning deduplication chromadb
  • class CombinedCleaner

    A document cleaner that combines hash-based and similarity-based cleaning approaches to remove both exact and near-duplicate documents in a two-stage process.

    File: /tf/active/vicechatdev/chromadb-cleanup/src/cleaners/combined_cleaner.py

    document-cleaning deduplication data-processing hash-based similarity-based
  • function process_inline_markdown

    Processes inline markdown formatting by unescaping HTML entities in text. Currently performs basic cleanup while preserving markdown syntax for downstream processing.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    markdown text-processing html-entities preprocessing formatting
  • function add_formatted_content_to_word_v1

    Converts processed markdown elements into formatted content within a Microsoft Word document, handling headers, paragraphs, lists, tables, and code blocks with appropriate styling.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    markdown-conversion word-document document-generation formatting docx
  • function add_formatted_content_to_pdf

    Processes markdown elements and adds them to a PDF document story with appropriate formatting, handling headers, paragraphs, lists, and tables.

    File: /tf/active/vicechatdev/vice_ai/complex_app.py

    pdf-generation reportlab markdown-to-pdf document-formatting content-rendering
  • function add_formatted_content_to_pdf_v1

    Converts processed markdown elements into formatted PDF content by adding paragraphs, headers, lists, and tables to a ReportLab story object with appropriate styling.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    pdf-generation reportlab markdown-conversion document-formatting content-rendering
  • class DataSource

    A dataclass that represents configuration for various data sources, supporting file-based, SQL database, and query-based data access patterns.

    File: /tf/active/vicechatdev/vice_ai/models.py

    dataclass configuration data-source serialization sql
  • function main_v56

    Performs comprehensive exploratory data analysis on a broiler chicken performance dataset, analyzing the correlation between Eimeria infection and performance measures (weight gain, feed conversion ratio, mortality rate) across different treatments and challenge regimens.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/343f5578-64e0-4101-84bd-5824b3c15deb/project_1/analysis.py

    data-analysis exploratory-data-analysis veterinary-research poultry-health correlation-analysis
  • function export_results

    Exports correlation analysis results to multiple CSV files, including overall correlations, grouped correlations, and significant findings.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-export csv file-io correlation-analysis results-persistence
  • function main_v26

    Orchestrates a complete correlation analysis pipeline for Eimeria infection and broiler performance data, from data loading through visualization and results export.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-analysis correlation statistics veterinary agriculture
  • function process_document

    Processes a document file (DOCX, DOC, or PDF) and extracts comprehensive metadata including file information, content metadata, and cryptographic hash.

    File: /tf/active/vicechatdev/CDocs/utils/document_processor.py

    document-processing metadata-extraction file-handling pdf docx
  • function execute_enhanced_workflow_background

    Executes an enhanced SQL workflow in a background thread, retrieving data from a database, processing it through an AI-powered workflow, and automatically triggering statistical analysis on the results.

    File: /tf/active/vicechatdev/full_smartstat/app.py

    background-processing threading sql-workflow data-retrieval ai-powered
  • function run_analysis_async

    Executes a data analysis workflow asynchronously with real-time progress tracking, including query interpretation, script generation, execution, and result finalization.

    File: /tf/active/vicechatdev/full_smartstat/app.py

    async data-analysis statistical-analysis progress-tracking llm-integration
  • class SQLQueryGenerator

    Generates SQL queries based on user requests and database schema

    File: /tf/active/vicechatdev/full_smartstat/sql_query_generator.py

    class sqlquerygenerator
  • function main_v61

    Demonstrates a SmartStat SQL workflow by loading a database schema, initializing a SQL query generator, and generating SQL queries from natural language requests with detailed output and metadata.

    File: /tf/active/vicechatdev/full_smartstat/demo_sql_workflow.py

    demonstration workflow sql-generation natural-language-processing database-schema
  • function main_v48

    Entry point function that orchestrates vendor enrichment testing by parsing command-line arguments, running setup validation, and executing a single vendor test against a ChromaDB collection.

    File: /tf/active/vicechatdev/find_email/test_enrichment.py

    testing vendor-enrichment command-line argparse chromadb
  • function test_llm_connectivity

    Tests the connectivity and functionality of an OpenAI LLM integration by analyzing a mock email with vendor information extraction.

    File: /tf/active/vicechatdev/find_email/test_vendor_extractor.py

    testing llm openai connectivity diagnostic
  • function main_v62

    Demonstrates the SmartStat SQL Workflow by loading a database schema, initializing a SQL query generator, and generating SQL queries from natural language requests for various laboratory data analysis scenarios.

    File: /tf/active/vicechatdev/smartstat/demo_sql_workflow.py

    demonstration sql-generation natural-language-processing database-schema workflow
  • function main_v54

    Performs statistical analysis on antibiotic usage data, comparing distribution patterns between vaccinated and non-vaccinated groups, and generates visualization plots, summary tables, and written conclusions.

    File: /tf/active/vicechatdev/smartstat/output/b7a013ae-a461-4aca-abae-9ed243119494/analysis_70ac0517/analysis.py

    data-analysis statistical-analysis healthcare antibiotics vaccination
  • function main_v55

    Performs statistical analysis to determine the correlation between antibiotic use frequency and vaccination modes (in-ovo vs non-in-ovo), generating visualizations and saving results to files.

    File: /tf/active/vicechatdev/smartstat/output/b7a013ae-a461-4aca-abae-9ed243119494/analysis_6cdbc6c8/analysis.py

    statistical-analysis correlation data-visualization pearson-correlation antibiotic-analysis
  • function wrap_tuple_streams

    Fills in None values in a tuple with corresponding dimensioned stream values based on matching key dimension names.

    File: /tf/active/vicechatdev/patches/util.py

    tuple-manipulation stream-processing data-substitution dimension-mapping holoviews
  • class Callable

    Callable is a wrapper class for callback functions used with DynamicMaps, providing memoization, stream management, and input/output tracking capabilities.

    File: /tf/active/vicechatdev/patches/spaces.py

    callback wrapper memoization caching dynamic-visualization
  • class InvoiceProcessor

    Main orchestrator class that coordinates the complete invoice processing pipeline from PDF extraction through validation to Excel generation.

    File: /tf/active/vicechatdev/invoice_extraction/main.py

    invoice-processing document-processing pdf-extraction entity-classification language-detection
  • function main_v4

    Main entry point function for an invoice processing system that monitors an inbound directory for PDF invoices, processes them using LLM extraction, generates Excel outputs, and moves processed files to a processed directory.

    File: /tf/active/vicechatdev/invoice_extraction/main.py

    invoice-processing pdf-processing llm-extraction batch-processing file-monitoring
  • class DocumentProcessor_v3

    A comprehensive PDF document processor that handles text extraction, OCR (Optical Character Recognition), layout analysis, table detection, and metadata extraction from PDF files.

    File: /tf/active/vicechatdev/invoice_extraction/core/document_processor.py

    pdf-processing ocr text-extraction document-processing invoice-processing

Search Examples