🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "encoding"

Found 50 matching component(s)

  • function test_template_with_data

    Tests a template file by replacing placeholders with test data and validates that all required placeholders have been filled, excluding known conditional placeholders.

    File: /tf/active/vicechatdev/test_comprehensive_templates.py

    template-testing validation placeholder-replacement jinja2 file-processing
  • function clean_text_for_xml

    Sanitizes text by removing or replacing XML-incompatible characters to ensure compatibility with Word document XML structure.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    text-processing xml sanitization word-documents character-encoding
  • function create_csv_report_improved

    Creates two CSV reports from warranty data: a summary report with key fields and a detailed report with all fields including full disclosures.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    csv report-generation file-io warranty data-export
  • function main_v10

    Orchestrates the conversion of an improved markdown file containing warranty disclosures into multiple tabular formats (CSV, Excel, Word) with timestamp-based file naming.

    File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

    file-conversion markdown-processing warranty-data csv-export excel-export
  • class RegulatoryExtractor

    A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

    File: /tf/active/vicechatdev/reg_extractor.py

    pdf-extraction regulatory-documents llm-extraction ocr data-extraction
  • function test_complex_url_hyperlink

    A test function that validates the creation of Word documents with complex FileCloud URLs containing special characters, query parameters, and URL fragments as clickable hyperlinks.

    File: /tf/active/vicechatdev/test_complex_hyperlink.py

    testing word-document hyperlink docx url-handling
  • function test_markdown_link_parsing

    A test function that validates markdown link parsing capabilities, specifically testing extraction and URL encoding of complex URLs containing special characters from Quill editor format.

    File: /tf/active/vicechatdev/test_complex_hyperlink.py

    testing markdown url-parsing regex url-encoding
  • function create_csv_report

    Creates two CSV reports (summary and detailed) from warranty data, writing warranty information to files with different levels of detail.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    csv reporting data-export file-io warranty
  • function main_v17

    Converts a markdown file containing warranty disclosure data into multiple tabular formats (CSV, Excel, Word) with timestamped output files.

    File: /tf/active/vicechatdev/convert_disclosures_to_table.py

    markdown-conversion data-extraction report-generation csv-export excel-export
  • function main_v20

    Main entry point function that reads a markdown file, converts it to an enhanced Word document with preserved heading structure, and saves it with a timestamped filename.

    File: /tf/active/vicechatdev/improved_word_converter.py

    document-conversion markdown-to-word file-processing docx main-entry-point
  • function main_v1

    Main orchestration function that reads an improved markdown file and converts it to an enhanced Word document with comprehensive formatting, including table of contents, warranty sections, disclosures, and bibliography.

    File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

    document-generation word-processing markdown-conversion docx file-processing
  • class MyEmbeddingFunction_v1

    A custom embedding function class that generates embeddings for documents using OpenAI's API, with built-in text summarization for long documents and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    embeddings openai chromadb vector-database text-summarization
  • class OneCo_hybrid_RAG

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

    class oneco_hybrid_rag
  • class FixedProjectVictoriaGenerator

    Fixed Project Victoria Disclosure Generator that properly handles all warranty sections.

    File: /tf/active/vicechatdev/fixed_project_victoria_generator.py

    class fixedprojectvictoriagenerator
  • class OneCo_hybrid_RAG_v1

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG_old.py

    class oneco_hybrid_rag
  • class MeetingMinutesGenerator

    A class that generates professional meeting minutes from meeting transcripts using OpenAI's GPT-4o model, with capabilities to parse metadata, extract action items, and format output.

    File: /tf/active/vicechatdev/meeting_minutes_generator.py

    meeting-minutes transcript-processing openai gpt-4o natural-language-processing
  • class MyEmbeddingFunction_v2

    A custom embedding function class that generates embeddings for text documents using OpenAI's embedding models, with automatic text summarization and token management for large documents.

    File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

    embeddings openai chromadb text-processing summarization
  • class DocumentProcessor_v5

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

    class documentprocessor
  • class ImprovedProjectVictoriaGenerator

    Improved Project Victoria Disclosure Generator with proper reference management.

    File: /tf/active/vicechatdev/improved_project_victoria_generator.py

    class improvedprojectvictoriagenerator
  • class MeetingMinutesGenerator_v1

    A class that generates professional meeting minutes from meeting transcripts using either OpenAI's GPT-4o or Google's Gemini AI models.

    File: /tf/active/vicechatdev/advanced_meeting_minutes_generator.py

    meeting-minutes transcript-processing llm gpt-4o gemini
  • class QueryBasedExtractor_v2

    A class that performs targeted information extraction from text using LLM-based query-guided extraction, with support for handling long documents through chunking and token management.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    information-extraction text-processing llm openai query-based
  • class OneCo_hybrid_RAG_v2

    A class named OneCo_hybrid_RAG

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class oneco_hybrid_rag
  • class ExtensiveSearchManager

    Manages extensive search functionality including full document retrieval, summarization, and enhanced context gathering.

    File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

    class extensivesearchmanager
  • function msg_to_eml_alternative

    Converts Microsoft Outlook .msg files to .eml (email) format using the extract_msg library, preserving email headers, body content (plain text and HTML), and attachments.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email-conversion msg-to-eml outlook mime email-processing
  • function msg_to_pdf

    Converts a Microsoft Outlook .msg email file to a single PDF document, including the email body and all attachments merged together.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email-conversion pdf-generation msg-file outlook document-processing
  • function generate_simple_html_from_eml

    Converts an email.message.Message object into a clean, styled HTML representation with embedded inline images and attachment listings.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email html-generation email-parsing mime inline-images
  • function eml_to_pdf

    Converts an .eml email file to PDF format, including the email body and all attachments merged into a single PDF document.

    File: /tf/active/vicechatdev/msg_to_eml.py

    email-processing pdf-conversion eml-parser document-conversion attachment-handling
  • class DocxMerger

    A class named DocxMerger

    File: /tf/active/vicechatdev/word_merge.py

    class docxmerger
  • class ProjectVictoriaDisclosureGenerator

    Main class for generating Project Victoria disclosures from warranty claims.

    File: /tf/active/vicechatdev/project_victoria_disclosure_generator.py

    class projectvictoriadisclosuregenerator
  • class MyEmbeddingFunction_v3

    A custom embedding function class that generates embeddings for text documents using OpenAI's embedding models, with automatic text summarization and token limit handling for large documents.

    File: /tf/active/vicechatdev/offline_docstore_multi.py

    embeddings openai vector-database chromadb text-processing
  • class DocumentProcessor_v6

    Process different document types for RAG context extraction

    File: /tf/active/vicechatdev/offline_docstore_multi.py

    class documentprocessor
  • class DocumentExtractor

    A document text extraction class that supports multiple file formats including Word, PowerPoint, PDF, and plain text files, with automatic format detection and conversion capabilities.

    File: /tf/active/vicechatdev/leexi/document_extractor.py

    document-processing text-extraction pdf word powerpoint
  • function extract_previous_reports_summary

    Extracts and summarizes key information from previous meeting report files using document extraction and OpenAI's GPT-4o-mini model to provide context for upcoming meetings.

    File: /tf/active/vicechatdev/leexi/app.py

    meeting-analysis document-extraction text-summarization llm openai
  • function generate_minutes

    Flask route handler that processes uploaded meeting transcripts and optional supporting documents to generate structured meeting minutes using AI, with configurable output styles and validation.

    File: /tf/active/vicechatdev/leexi/app.py

    flask web-api file-upload meeting-minutes ai-generation
  • function regenerate_minutes

    Flask route handler that regenerates meeting minutes from a previous session using modified instructions, model selection, and configuration parameters.

    File: /tf/active/vicechatdev/leexi/app.py

    flask meeting-minutes regeneration ai-generation openai
  • function test_attendee_extraction_comprehensive

    A comprehensive test function that validates the attendee extraction logic from meeting transcripts, comparing actual speakers versus mentioned names, and demonstrating integration with meeting minutes generation.

    File: /tf/active/vicechatdev/leexi/test_attendee_comprehensive.py

    testing attendee-extraction meeting-minutes transcript-parsing speaker-identification
  • class EnhancedMeetingMinutesGenerator

    A class named EnhancedMeetingMinutesGenerator

    File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

    class enhancedmeetingminutesgenerator
  • function main_v2

    Command-line interface function that orchestrates the generation of enhanced meeting minutes from transcript files and PowerPoint presentations using various LLM models (GPT-4o, Azure GPT-4o, or Gemini).

    File: /tf/active/vicechatdev/leexi/enhanced_meeting_minutes_generator.py

    cli command-line meeting-minutes llm gpt-4
  • function test_attendee_extraction

    A test function that validates the attendee extraction logic of the EnhancedMeetingMinutesGenerator by parsing a meeting transcript and displaying extracted metadata including speakers, date, and duration.

    File: /tf/active/vicechatdev/leexi/test_attendee_extraction.py

    testing unit-test meeting-minutes attendee-extraction metadata-parsing
  • function create_test_file_v1

    Creates a temporary test file with specified content and filename in a temporary directory.

    File: /tf/active/vicechatdev/SPFCsync/test_upload_modalities.py

    testing file-creation temporary-files test-utilities file-operations
  • function save_custom_instructions

    Flask API endpoint that saves custom instructions to a markdown file in a designated instructions directory with filename sanitization.

    File: /tf/active/vicechatdev/docchat/app.py

    flask api-endpoint file-operations rest-api file-saving
  • function load_custom_instructions

    Flask API endpoint that loads and returns the content of custom instruction files from a designated instructions directory with security validation.

    File: /tf/active/vicechatdev/docchat/app.py

    flask api-endpoint file-loading security path-traversal-prevention
  • class QueryBasedExtractor

    A class that extracts relevant information from documents using a small LLM (Language Model), designed for Extensive and Full Reading modes in RAG systems.

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    information-extraction document-processing llm rag query-based
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • class DocChatEmbeddingFunction

    A custom ChromaDB embedding function that generates OpenAI embeddings with automatic text summarization for documents exceeding token limits.

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    embeddings chromadb openai text-processing summarization
  • class DocumentProcessor_v8

    Process different document types for indexing

    File: /tf/active/vicechatdev/docchat/document_indexer.py

    class documentprocessor
  • class GPT5Validator

    A comprehensive testing and validation class for OpenAI GPT models, with special support for GPT-5 family models using the Responses API.

    File: /tf/active/vicechatdev/docchat/test_gpt5_readiness.py

    testing validation openai gpt-5 api-testing
  • function test_local_document

    Integration test function that validates end date extraction from a local PDF document using document processing and LLM-based analysis.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_local_document.py

    testing integration-test document-processing pdf-extraction llm
  • class ContractDataExtractor

    Extract structured data from legal contracts using LLM analysis

    File: /tf/active/vicechatdev/contract_validity_analyzer/extractor.py

    class contractdataextractor
  • function test_with_real_documents

    Tests a contract analyzer system by processing real documents from FileCloud, extracting contract information, and generating analysis reports with performance metrics.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_real_documents.py

    testing integration-test contract-analysis document-processing filecloud

Search Examples