Open links in new tab
  1. YData data quality for Data Science | Synthetic data Data-Centric AI

    Generate synthetic data, manage data, improve data quality, and build the best datasets for your AI projects with the YData Fabric platform.

  2. Welcome - YData Profiling

    YData Profiling simplifies data understanding and preparation by automating detailed reports, statistics, and visualizations in a single line of code.

  3. YData Fabric Platform

    Get easy access to data, improve data quality, and create synthetic data with the YData Fabric platform.

  4. YData SDK

    YData SDK is the leading Python package for Data & AI, providing an ecosystem of methods that enables data professionals to adopt a data-centric development approach focused on improving data …

  5. YData SDK | Data profiling

    Exploratory data analysis and data profiling are foundational steps in data governance, data management and for a successful adoption of AI.

  6. YData Python Package

    YData SDK The python package for 'all things data' The fastest path to deliver high-quality data. Automated data profiling and synthetic data in a user friendly python package that unlocks …

  7. Best practices - YData SDK

    Best practices for optimal synthetic data generation Overview This document outlines the best practices for generating structured synthetic data, focusing on ensuring data quality, privacy, and utility. …

  8. YData Profiling: The debut of Pandas Profiling in the Big Data world

    Feb 1, 2023 · Discover ydata-profiling, the open-source data profiling package with Spark DataFrame support. Transform big data into smart data with profiling at scale.

  9. Examples - YData Profiling

    Examples The following example reports showcase the potentialities of the package across a wide range of dataset and data types: Census Income (US Adult Census data relating income with other …

  10. YData | Data profiling in a Catalog

    Exploratory data analysis and data profiling are foundational steps in a successful adoption of AI. The data catalog provides management and scale.