来自Castled 官方的一篇性能对比
2022/1/14 6:06:07
本文主要是介绍来自Castled 官方的一篇性能对比,对大家解决编程问题具有一定的参考价值,需要的程序猿们随着小编来一起学习吧!
转自:https://medium.com/castled/fastest-reverse-etl-platform-census-vs-hightouch-vs-castled-3d2975dd4e55
Fastest Reverse ETL Platform: Census vs Hightouch vs Castled
It is not even a year since the term Reverse ETL was coined. Since then, hundreds of modern data driven organisations have completed their “data integration loop” by syncing their valuable customer and product insights from the cloud data warehouse to their business tools.
While the entire data community has accepted Reverse-ETL as the missing piece in the modern data stack, there have been few debates happening of late about the need for speed in a Reverse ETL solution. Census started the discussion when they published this blog a month ago in which they claimed to be the fastest (44x faster than all the competitors) Reverse ETL Solution. They even encouraged other solutions to publish their respective performance benchmarks using their dataset.
While we do have a different take on the need for speed in a data integration solution, we thought it would be interesting to take up this challenge.
Benchmark Specifications and Results
The BigQuery table used in the benchmark was created by importing the CSV file shared by Census. Castled took 1 min 17 seconds to sync 2.2 million records from BigQuery to Mix panel Events API.
Out of this, it took 40 seconds to query and export the query results to the GCS bucket and around 35 seconds by our data sync framework to sync the data to Mixpanel Events API.
How do we compare with Hightouch and Census
We have taken the numbers published by Census for the same benchmark for our comparison. Since Hightouch has not published any official numbers for the same, we went ahead and tried the same benchmark on Hightouch’s cloud platform.
As per our benchmarks, Castled is 642% faster than Census and 1324% faster than Hightouch. Census clocked 8 mins 15 seconds to sync the same dataset while Hightouch took 17 minutes. Castled was able to sync the data at a throughput of 29351 records/second, while Census and Hightouch were able to handle 4565 records/second and 2215 records/second respectively.
Ramping up the load
While we were at it, we ramped up the load even further and checked how Castled scales up to larger data sets. We benchmarked Castled to scale up to 100 million records. Castled took ~27 minutes to sync 100 million records to Mix panel at the rate of 63613 requests/second.
Syncing 100 million records
We also observed that our throughput increases linearly with increase in the number of records synced. Thats majorly because the average time taken to query and export a record reduces considerably, as the size of the sample set increases.
So Is Castled the fastest Reverse ETL solution ?
As per the benchmarks done, currently we are way faster than Census and Hightouch. But claiming to be the fastest Reverse ETL solution at this stage when Reverse ETL is just starting up, seems a bit immature in our opinion.
While we know that we have built a highly scalable platform at Castled, we like to believe performance is just a number. Our engineering team comprises experienced engineers who have scaled systems from zero to near infinity. So we know that other solutions can potentially scale up their platforms in a short period with significant investment in engineering.
Do not just take our word for it
You do not have to just take our word on these numbers we have published. We are open source. You should be able to spin up Castled on your desktop or deploy it on-premise in under a minute and try this out yourselves.
This is our Github Repo: https://github.com/castledio/castled. If you find it useful, do not hesitate to show your support by starring our repo.
这篇关于来自Castled 官方的一篇性能对比的文章就介绍到这儿,希望我们推荐的文章对大家有所帮助,也希望大家多多支持为之网!
- 2025-01-11国产医疗级心电ECG采集处理模块
- 2025-01-10Rakuten 乐天积分系统从 Cassandra 到 TiDB 的选型与实战
- 2025-01-09CMS内容管理系统是什么?如何选择适合你的平台?
- 2025-01-08CCPM如何缩短项目周期并降低风险?
- 2025-01-08Omnivore 替代品 Readeck 安装与使用教程
- 2025-01-07Cursor 收费太贵?3分钟教你接入超低价 DeepSeek-V3,代码质量逼近 Claude 3.5
- 2025-01-06PingCAP 连续两年入选 Gartner 云数据库管理系统魔力象限“荣誉提及”
- 2025-01-05Easysearch 可搜索快照功能,看这篇就够了
- 2025-01-04BOT+EPC模式在基础设施项目中的应用与优势
- 2025-01-03用LangChain构建会检索和搜索的智能聊天机器人指南