Search - Code Extractor

function extract_warranty_data_improved

Parses markdown-formatted warranty documentation to extract structured warranty data including IDs, titles, sections, disclosure text, and reference citations.

File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

markdown-parsing text-extraction warranty-processing document-parsing regex

function parse_references_section

Parses a formatted references section string and extracts structured data including reference numbers, sources, and content previews using regular expressions.

File: /tf/active/vicechatdev/improved_convert_disclosures_to_table.py

parsing text-processing references citations regex

function get_bibtext

Retrieves and parses BibTeX citation data for a given DOI (Digital Object Identifier), extracting the title and formatted BibTeX string.

File: /tf/active/vicechatdev/offline_parser_docstore.py

bibliography bibtex doi citation academic

function create_folder_hierarchy_v2

Creates a hierarchical structure of Subfolder nodes in a Neo4j graph database based on a file path, establishing parent-child relationships between folders.

File: /tf/active/vicechatdev/offline_parser_docstore.py

neo4j graph-database hierarchy folder-structure file-system

class RegulatoryExtractor

A class for extracting structured metadata from regulatory guideline PDF documents using LLM-based analysis and storing the results in an Excel tracking spreadsheet.

File: /tf/active/vicechatdev/reg_extractor.py

pdf-extraction regulatory-documents llm-extraction ocr data-extraction

function test_markdown_link_parsing

A test function that validates markdown link parsing capabilities, specifically testing extraction and URL encoding of complex URLs containing special characters from Quill editor format.

File: /tf/active/vicechatdev/test_complex_hyperlink.py

testing markdown url-parsing regex url-encoding

function extract_warranty_data

Parses markdown-formatted warranty documentation to extract structured warranty information including IDs, titles, sections, source document counts, warranty text, and disclosure content.

File: /tf/active/vicechatdev/convert_disclosures_to_table.py

markdown-parsing data-extraction warranty-processing text-processing regex

function create_word_report

Generates a formatted Microsoft Word document report containing warranty disclosures with a table of contents, metadata, and structured sections for each warranty.

File: /tf/active/vicechatdev/convert_disclosures_to_table.py

document-generation word-document docx report-generation warranty

function clean_text_for_xml_v1

Sanitizes text strings to ensure XML 1.0 compatibility by removing or replacing invalid control characters and ensuring all characters meet XML specification requirements for Word document generation.

File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

text-processing xml sanitization data-cleaning word-documents

function extract_warranty_sections

Parses markdown content to extract warranty section headers, returning a list of dictionaries containing section IDs and titles for table of contents generation.

File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

markdown-parsing text-processing warranty-documents table-of-contents document-structure

function extract_total_references

Extracts the total count of references from markdown-formatted content by first checking for a header line with the total, then falling back to manually counting reference entries.

File: /tf/active/vicechatdev/enhanced_word_converter_fixed.py

markdown parsing text-processing references bibliography

class OneCo_hybrid_RAG

A class named OneCo_hybrid_RAG

File: /tf/active/vicechatdev/OneCo_hybrid_RAG copy.py

class oneco_hybrid_rag

function create_document_version_v2

Creates a new version of an existing document in a document management system, storing the file in FileCloud and tracking version metadata in Neo4j graph database.

File: /tf/active/vicechatdev/document_controller_backup.py

document-management version-control filecloud neo4j graph-database

class FileCloudAPI

Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

File: /tf/active/vicechatdev/FC_api copy.py

class filecloudapi

class OneCo_hybrid_RAG_v1

A class named OneCo_hybrid_RAG

File: /tf/active/vicechatdev/OneCo_hybrid_RAG_old.py

class oneco_hybrid_rag

class FileCloudAPI_v1

Python wrapper for the FileCloud REST API. This class provides methods to interact with FileCloud server APIs, handling authentication, session management, and various file operations.

File: /tf/active/vicechatdev/FC_api.py

class filecloudapi

function create_folder_hierarchy

Creates a hierarchical structure of Subfolder nodes in a Neo4j graph database based on a file system path, connecting each folder level with PATH relationships.

File: /tf/active/vicechatdev/offline_docstore_multi_vice.py

neo4j graph-database file-system hierarchy folder-structure

function main_v24

Command-line interface function that orchestrates the generation of meeting minutes from a transcript file using either GPT-4o or Gemini LLM models.

File: /tf/active/vicechatdev/advanced_meeting_minutes_generator.py

cli command-line meeting-minutes transcript-processing llm

class OneCo_hybrid_RAG_v2

A class named OneCo_hybrid_RAG

File: /tf/active/vicechatdev/OneCo_hybrid_RAG.py

class oneco_hybrid_rag

function parse_email_address

Parses email address strings by handling multiple addresses separated by semicolons and converting them to comma-separated format.