Connect to any steps in the data pipeline like source system, raw data source (S3, Blob, GCS), DW (BigQuery, Snowflake, Redshift ), or streaming source like Kafka/pub-sub. Telmai works best for your spark based pipelines :-)
Telmai will automatically learn all about your data-set like its schema, volume, value distributions, completeness/uniqueness of values, expected ranges, expected values, etc., and present it to users. Users can then identify outliers and provide input to our system using our Human in the loop approach.
Telmai will automatically startmonitoring the incoming data for any drifts in the data metrics over time-time. Users will automatically get alerted if there is an unexpected change in data.
Data owners can then proactively review these drifts before a downstream impact.
On this page
Start your data observibility today
Connect your data and start generating a baseline in less than 10 minutes.