Which approach is better for data integration
I see that many companies are currently using python as their ETL tool. I came from PDI (Pentaho Data Integration), SSIS and other ETL tools. How performance-efficient does Python offer compared to the above tools?
Currently My approach to data integration
- If the source is any storage like Mysql, MSSQL, Salesforce API, Google Spreadsheet, CSV file, Nosql DB, I prefer ETL PDI tool for data integration.
- If the source is any API like Graphana, Humanity or API, and also for a dirty data source file like CSV, then I prefer Python
Is my approach correct?
+3
source to share
2 answers