The Open SourceData Generator
Synth is an open source data-as-code tool that provides a simple CLI workflow for generating consistent data in a scalable way.
Synth encodes dataset constraints, relationships and business logic into declarative configuration files.
Backed by
YCombinator
What you can do with Synth
Anonymize
Anonymize sensitive production data.
Use Synth to generate correct, anonymized data that looks and quacks like production.
Seed
Seed development and environments and CI
Generate test data fixtures for your development, testing and continuous integration.
Synthesize
Create realistic data to your specifications
Generate data that tells the story you want to tell. Specify constraints, relations and all your semantics.
A versatile API for all types of data
Generate a stock prices time series
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
{
  "type": "object",
  "timestamp": {
    "type": "string",
    "date_time": {
      "format": "%Y-%m-%dT%H:%M:%S",
      "subtype": "naive_date_time",
      "begin": "2020-06-07T12:00:00"
    }
  },
  "px_last": {
    "type": "number",
    "range": {
      "low": 60,
      "high": 80,
      "step": 0.1
    }
  },
  "volume": {
    "type": "number",
    "range": {
      "low": 1000,
      "high": 10000,
      "step": 1
    }
  }
}
Time series data
Relational data
Event logs data
Synth helps you write better software, faster. Join the community!
discord