Example with CSV

Click on "New Job," and a new page will open.

  • Choose a job name for your data synthesis project.

  • Select your desired locale for the synthetic data.

  • Select Source Type: In this demo, we are working with a CSV file as the source of our data. Specify this by selecting "CSV file" as the source type.

  • Upload CSV File: Upload the CSV file containing the data that you want to synthesize. This is the actual data source from which synthetic data will be generated. The file should contain the data you wish to work with in your data synthesis process.

After the file is uploaded, the Application will analyze the columns in the table.

It will also detect any personally identifiable information (PII) and suggest synthesis strategies

Assign Semantics and Patterns

You will need to select elements from the uploaded data that you want to synthesize.

  • For each selected element, you must assign semantics and patterns (data format).

  • The Application automatically provides suggestions for semantics and patterns based on the data provided (e.g., name, address, email, DOB).

  • You can modify the suggested semantics according to your specific needs.

Save Element Configurations

You can select as many elements as you want to be synthesized and save each element individually after assigning semantics and patterns.

Confirm Synthesis Strategy

  • After all the elements are assigned semantics and patterns, click "Next."

  • The website will showcase all the selected fields on the left side and ask you to confirm the synthesis strategy.

Submit Job

  • Once you are satisfied with the selected fields and synthesis strategy, click "Submit Job."

  • The data synthesis process will be initiated, and the job will be added to the queue for synthesis.

Monitor Job Status

  • You can monitor the status of your job. Initially, it will be "In Progress."

  • Once the job is completed, the status will change to "Finished."

Job Completion

  • Once the data synthesis process is completed, the status of your job will change to "Finished." This indicates that the system has successfully generated synthetic data based on your defined parameters.

  • After the status has been changed to "Finished," you can access detailed information about the job. To do this, click on "Job Details."

View Job Details

Clicking on "Job Details" will open a page displaying comprehensive information about your job, including all the settings and configurations you defined.

Download Synthesized Data

Within the job details page, you will find a "Download File" button. Clicking on this button will allow you to download the synthesized version of the original CSV data that you uploaded. This synthesized data is now ready for use in your projects, without exposing sensitive or real information.

Access Data Reports

Towards the end of the page, you will find a "Report" button. Clicking on this button will provide access to a range of synthesized data reports:

  • Accuracy Report: This report may include visual representations or graphs that assess the accuracy of the synthesized data in comparison to the original data.

  • Overall Accuracy: This section offers insights into the overall accuracy of the synthesized data, helping you evaluate the quality and reliability of the generated data.

  • Correlation Matrices Report: Correlation matrices are generated to show the relationships and dependencies between different data attributes in the synthesized data.

  • Pair Plot Report: Pair plots provide visual representations of relationships between pairs of data attributes, offering insights into data patterns and correlations.

  • Distribution Report: The distribution report offers graphical representations of the distribution of data attributes in the synthesized data, which can be useful for data analysis and understanding the synthetic data's characteristics.

By following these steps and accessing the provided reports, you can thoroughly assess the quality and characteristics of the synthesized data, ensuring it meets your requirements and objectives.

Last updated