
How to generate synthetic data for csv files to test data pipelines
Using Neosync to generate csv files with synthetic data for testing your data pipelines is easy. Here's how.
March 3rd, 2025
We're back with another set of brand new features from June! Here's what's new to Neosync:
Neosync now supports integrating with MongoDB! With this integration, we now have sql and nosql support across the most popular databases.
We now validate your transformer mappings in real time against your schema to ensure that your mappings are correct and contain the necessary references.
We've added support for Google Cloud Storage as a destination connection that you can use to sync your data.
You can now clone an entire job with just one click! This is really useful for testing or creating new jobs from existing ones without having to reconfigure everything from scratch.
We've overhauled the synthetic data generator for OpenAI and other LLMs to be way more stable and performant. We've also added the ability to configure the batch size.
Subsetting queries now have auto-complete for columns so you no longer have to toggle back and forth between your database and the Neosync app. Your columns are just there!
You can now get a row count for your subset query to validate if it's working correctly!
Here's some news and articles that we found interesting this month:
Fun fact: Notion only used a single Postgres instance for YEARS. Here's the blog if you're curious.
Some helpful links to have handy:
Thanks for reading and see you in July!
Using Neosync to generate csv files with synthetic data for testing your data pipelines is easy. Here's how.
March 3rd, 2025
A introduction to Ephemeral Data and how it can accelerated engineering cycles
January 29th, 2025
Nucleus Cloud Corp. 2025