🔍 Code Extractor

Search Components

Full-Text: Fast keyword matching | Semantic: AI-powered understanding of intent (finds similar concepts)

Search Results for "dataset"

Found 50 matching component(s)

  • function quick_clean

    Cleans flock data by identifying and removing flocks that have treatment records with timing inconsistencies (treatments administered outside the flock's start/end date range).

    File: /tf/active/vicechatdev/quick_cleaner.py

    data-cleaning data-quality flock-management livestock poultry
  • function select_dataset

    Interactive command-line function that prompts users to select between original, cleaned, or comparison of flock datasets for analysis.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    user-interface dataset-selection interactive command-line data-loading
  • function load_analysis_data

    Loads CSV dataset(s) into pandas DataFrames based on dataset configuration, supporting both single dataset loading and comparison mode with two datasets.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-loading csv pandas file-io data-analysis
  • function create_data_quality_dashboard_v1

    Creates an interactive data quality dashboard for analyzing treatment timing issues in poultry flock management data by loading and processing CSV files containing timing anomalies.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-quality dashboard visualization poultry-management treatment-timing
  • function create_data_quality_dashboard

    Creates an interactive command-line dashboard for analyzing data quality issues in treatment timing data, specifically focusing on treatments administered outside of flock lifecycle dates.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-quality dashboard interactive menu-driven timing-analysis
  • function show_problematic_flocks

    Analyzes and displays problematic flocks from a dataset by identifying those with systematic timing issues in their treatment records, categorizing them by severity and volume.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-quality reporting diagnostics livestock-management data-validation
  • function compare_datasets

    Analyzes and compares two pandas DataFrames containing flock data (original vs cleaned), printing detailed statistics about removed records, type distributions, and impact assessment.

    File: /tf/active/vicechatdev/data_quality_dashboard.py

    data-quality comparison analysis reporting statistics
  • class DocChatRAG

    Main RAG engine with three operating modes: 1. Basic RAG (similarity search) 2. Extensive (full document retrieval with preprocessing) 3. Full Reading (process all documents)

    File: /tf/active/vicechatdev/docchat/rag_engine.py

    class docchatrag
  • function test_with_real_documents

    Tests a contract analyzer system by processing real documents from FileCloud, extracting contract information, and generating analysis reports with performance metrics.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_real_documents.py

    testing integration-test contract-analysis document-processing filecloud
  • function main_v24

    Interactive CLI function that allows users to select and run document processing test scenarios with varying document counts, providing feedback on test success and next steps.

    File: /tf/active/vicechatdev/contract_validity_analyzer/test_real_documents.py

    cli interactive testing document-processing user-input
  • function clean_collection

    Cleans a ChromaDB collection by removing duplicate and similar documents using hash-based and similarity-based deduplication techniques, then saves the cleaned data to a new collection.

    File: /tf/active/vicechatdev/chromadb-cleanup/main.py

    data-cleaning deduplication chromadb vector-database similarity-detection
  • function test_nearly_similar_text_handling

    A pytest test function that verifies the SimilarityCleaner's ability to identify and remove nearly similar text entries while preserving distinct ones.

    File: /tf/active/vicechatdev/chromadb-cleanup/tests/test_similarity_cleaner.py

    testing pytest text-processing similarity-detection deduplication
  • class AgentExecutor

    Agent-based script executor that generates standalone Python files, manages dependencies, and provides iterative debugging capabilities

    File: /tf/active/vicechatdev/vice_ai/agent_executor.py

    class agentexecutor
  • class ScriptExecutor

    A sandboxed Python script executor that safely runs user-provided Python code with timeout controls, security restrictions, and isolated execution environments for data analysis tasks.

    File: /tf/active/vicechatdev/vice_ai/script_executor.py

    sandbox script-execution security code-validation data-analysis
  • class SmartStatSession

    A session management class that encapsulates a SmartStat statistical analysis session, tracking data, analysis history, plots, and reports for a specific data section.

    File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

    session-management data-analysis statistics dataframe multi-dataset
  • class SmartStatService

    Service for running SmartStat analysis sessions in Vice AI

    File: /tf/active/vicechatdev/vice_ai/smartstat_service.py

    class smartstatservice
  • function add_data_section_to_docx

    Adds a data analysis section to a Word document, including analysis metadata, statistical conclusions, and embedded visualizations from saved content or legacy analysis history.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    document-generation word-document data-analysis visualization report-generation
  • function add_data_section_to_pdf

    Adds a data analysis section to a PDF document story, including analysis metadata, statistical conclusions, and embedded visualizations from saved content or analysis history.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    pdf-generation reportlab data-analysis document-export visualization
  • function upload_data_section_dataset

    Flask API endpoint that handles CSV file uploads for data section analysis, processes the file, extracts metadata, and stores it in the data section for persistence.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    file-upload csv-processing data-analysis authentication api-endpoint
  • function smartstat_workspace

    Flask route handler that opens a SmartStat statistical analysis workspace for a specific data section, managing session creation, data restoration, and access control.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    flask-route web-handler statistical-analysis session-management data-restoration
  • function smartstat_upload_data

    Flask route handler that uploads CSV or Excel data files to a SmartStat analysis session, with support for multi-sheet Excel files and session recovery.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    flask file-upload csv excel data-analysis
  • function smartstat_select_sheet

    Flask API endpoint that processes one or more Excel sheets from an uploaded file, validates them, categorizes them as datasets or information sheets, and adds them to a SmartStat analysis session.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    excel data-loading sheet-selection api-endpoint authentication
  • function smartstat_upload_files

    Flask API endpoint that handles multi-file uploads (CSV, Excel, PDF, Word, PowerPoint) to a SmartStat session, processing data files as datasets and documents as information sheets.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    file-upload multi-file csv-processing excel-processing pdf-extraction
  • function smartstat_get_history

    Flask API endpoint that retrieves analysis history for a SmartStat session, with automatic session recovery from saved data if the session is not found in memory.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    flask api-endpoint session-management data-recovery authentication
  • function upload_analysis_dataset

    Flask API endpoint that handles file upload for data analysis sessions, accepting CSV and Excel files, validating user access, and processing the dataset through a data analysis service.

    File: /tf/active/vicechatdev/vice_ai/new_app.py

    flask file-upload data-analysis api-endpoint authentication
  • class Config

    Configuration class that manages application-wide settings, directory structures, API keys, and operational parameters for a statistical analysis application.

    File: /tf/active/vicechatdev/vice_ai/smartstat_config.py

    configuration settings flask api-keys directory-management
  • class DataAnalysisService

    Service class for managing data analysis operations within document sections, integrating with SmartStat components for statistical analysis, dataset processing, and visualization generation.

    File: /tf/active/vicechatdev/vice_ai/data_analysis_service.py

    data-analysis statistical-analysis session-management dataset-processing visualization
  • function create_test_dataset

    Creates a test CSV dataset with sample product sales data across different regions and months, saving it to a temporary file.

    File: /tf/active/vicechatdev/vice_ai/test_integration.py

    testing test-data csv dataset-generation synthetic-data
  • function main_v25

    Orchestrates and executes a comprehensive test suite for the Vice AI Data Analysis Integration, running multiple test functions, creating test datasets, and providing detailed pass/fail reporting.

    File: /tf/active/vicechatdev/vice_ai/test_integration.py

    testing integration-tests test-runner data-analysis test-orchestration
  • class DataSection

    A dataclass representing a dedicated data analysis section that stores analysis results, plots, dataset information, and conclusions separately from text content.

    File: /tf/active/vicechatdev/vice_ai/models.py

    dataclass data-analysis serialization versioning content-management
  • class DatabaseManager_v1

    SQLite database manager for persistent storage

    File: /tf/active/vicechatdev/vice_ai/models.py

    class databasemanager
  • class StatisticalAgent

    LLM-powered statistical analysis agent

    File: /tf/active/vicechatdev/vice_ai/statistical_agent.py

    class statisticalagent
  • function remove_outliers_iqr

    Removes outliers from a pandas DataFrame column using the Interquartile Range (IQR) method with a conservative 3*IQR threshold.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/42b81361-ba7e-4d79-9598-3090af68384b/analysis_2.py

    data-cleaning outlier-detection IQR interquartile-range data-preprocessing
  • function remove_outliers

    Removes outliers from a pandas DataFrame based on the Interquartile Range (IQR) method for a specified column.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/f5da873e-41e6-4f34-b3e4-f7443d4d213b/analysis_5.py

    data-cleaning outlier-detection IQR interquartile-range data-preprocessing
  • function calculate_cv

    Calculates the coefficient of variation (CV) for a dataset, expressed as a percentage of the standard deviation relative to the mean.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/d48d7789-9627-4e96-9f48-f90b687cd07d/analysis_1.py

    statistics coefficient-of-variation data-analysis variability dispersion
  • function load_dataset

    Loads a CSV dataset from a specified file path using pandas and returns it as a DataFrame with error handling for file not found and general exceptions.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/e1ecec5f-4ea5-49c5-b4f5-d051ce851294/project_1/analysis.py

    data-loading csv pandas file-io error-handling
  • function perform_analysis

    Performs comprehensive statistical analysis on grouped biological/experimental data, including descriptive statistics, correlation analysis, ANOVA testing, and visualization of infection levels and growth performance across different groups.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/e1ecec5f-4ea5-49c5-b4f5-d051ce851294/project_1/analysis.py

    statistical-analysis data-analysis ANOVA correlation visualization
  • function calculate_cv_v1

    Calculates the Coefficient of Variation (CV) for a dataset, expressed as a percentage. CV measures relative variability by dividing standard deviation by mean.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/d1e252f5-950c-4ad7-b425-86b4b02c3c62/analysis_4.py

    statistics coefficient-of-variation variability dispersion data-analysis
  • function main_v57

    Performs comprehensive exploratory data analysis on a broiler chicken performance dataset, analyzing the correlation between Eimeria infection and performance measures (weight gain, feed conversion ratio, mortality rate) across different treatments and challenge regimens.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/343f5578-64e0-4101-84bd-5824b3c15deb/project_1/analysis.py

    data-analysis exploratory-data-analysis veterinary-research poultry-health correlation-analysis
  • function load_data

    Loads a CSV dataset from a specified filepath using pandas, with fallback to creating sample data if the file is not found.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-loading csv pandas file-io error-handling
  • function create_sample_data_v2

    Generates a synthetic dataset of 200 poultry research records with multiple treatment groups, challenge regimens, and performance metrics for demonstration purposes.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-generation synthetic-data poultry-research veterinary coccidiosis
  • function explore_data

    Performs comprehensive exploratory data analysis on a pandas DataFrame, printing dataset overview, data types, missing values, descriptive statistics, and identifying categorical and numerical variables.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-exploration EDA exploratory-data-analysis data-profiling pandas
  • function identify_variables

    Categorizes DataFrame columns into Eimeria infection variables, performance measure variables, and grouping variables based on keyword matching in column names.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-preprocessing variable-classification keyword-matching veterinary-research eimeria
  • function calculate_correlations

    Calculates both Pearson and Spearman correlation coefficients between Eimeria variables and performance variables, filtering out missing values and identifying statistically significant relationships.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    correlation statistics data-analysis pearson spearman
  • function main_v26

    Orchestrates a complete correlation analysis pipeline for Eimeria infection and broiler performance data, from data loading through visualization and results export.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/5a059cb7-3903-4020-8519-14198d1f39c9/analysis_1.py

    data-analysis correlation statistics veterinary agriculture
  • function detect_outliers_iqr_v1

    Detects outliers in a dataset using the Interquartile Range (IQR) method, returning boolean indices of outliers and the calculated bounds.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/328d2f87-3367-495e-89f7-e633ff8c5b3d/analysis_2.py

    outlier-detection IQR interquartile-range statistics data-cleaning
  • function detect_outliers_zscore

    Detects outliers in numerical data using the Z-score statistical method, identifying data points that deviate significantly from the mean.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/328d2f87-3367-495e-89f7-e633ff8c5b3d/analysis_2.py

    outlier-detection statistics data-cleaning anomaly-detection z-score
  • function detect_outliers_iqr_v2

    Detects statistical outliers in a dataset using the Interquartile Range (IQR) method with a conservative 3×IQR threshold.

    File: /tf/active/vicechatdev/vice_ai/smartstat_scripts/84b9ac09-e646-4422-9d3a-e9f96529a553/analysis_1.py

    outlier-detection statistics data-cleaning IQR interquartile-range
  • class ScriptExecutor_v1

    A sandboxed Python script executor that safely runs user-provided Python code with timeout controls, security restrictions, and isolated execution environments for data analysis tasks.

    File: /tf/active/vicechatdev/full_smartstat/script_executor.py

    sandbox security script-execution code-execution timeout
  • function execute_enhanced_workflow_background

    Executes an enhanced SQL workflow in a background thread, retrieving data from a database, processing it through an AI-powered workflow, and automatically triggering statistical analysis on the results.

    File: /tf/active/vicechatdev/full_smartstat/app.py

    background-processing threading sql-workflow data-retrieval ai-powered

Search Examples