You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
description: 'Here''s what''s coming in the next few days, weeks, months, and years!'
Roadmap
Coming within a few days
Check out our Roadmap for Core on GitHub. You'll see the features we're currently working on or about to. You may also give us insights, by adding your own issues and voting for specific features / integrations.
Coming within a few weeks / months
We understand that we're not "production-ready" for a lot of companies yet. In the end, we just got started in May 2019, so we're at the beginning of the journey. Here is a highlight of the main features we are planning on releasing in the next few months:
Landing in December 2021 or so:
Release of DataSphere Portal Service GA.
Release of DataSphere Catalogue Service beta.
Release of DataSphere Migration Service beta.
Support of most popular databases as both sources and destinations.
Support of data lakes, including SnowFlake, HashData, Greenplum, GaussDB, ClickHouse, DataFuse, Hadoop .
Coming a bit later until July 2022 :
Release of DataSphere MDM Service beta.
Release of DataSphere Management Service beta.
Our goal is to become "production-ready" for any company whatever their data stack, infrastructure, architecture, data volume, and connector needs.
Coming within a few quarters / years
We also wanted to share with you how we think about the high-level roadmap over the next few months and years. We foresee several high-level phases that we will try to share here.
1. Parity on data consolidation (ELT) in warehouses / databases
Our first focus is to support batch-type ELT integrations. We feel that we can provide value right away as soon as we support one of the integrations you need. Batch integrations are also easier to build and sustain. So we would rather start with that.
Before we move on to the next phase, we want to make sure we are supporting all the major integrations and that we are in a state where we can address the long tail, with the help of the community.
We also want to fully integrate with the open-source ecosystem, including Airflow, dbt, Kubernetes, GreatExpectations, etc., so teams have the ability to fully build the data infrastructure they need.
2. Reverse-ETL from warehouses / databases
Some integrations we have in mind are batch distribution integrations, from warehouses to third-party tools. For instance, a use case could be if your marketing team wants to send back the data to your ad platforms, so it can better optimize the campaigns. Another use case could be syncing the consolidated data back to your CRM.
It’s not yet clear in our minds when to prioritize those additional integrations. We will have a better idea once we see the feedback we get from the community we build with data consolidation.
**3. Parity on data catalogue in warehouses / databases **
we will consolidate all the data types into data lakes, and build a data catalogue to help you search and analyse your data resources.
3. Cross-cloud data assets management to realize distributed data-business synergy
we will build a data consistence infrastructure to achieve data migration and synchronization cloud input and output.
4. Expand on all data engineering features
This is when we will start differentiating ourselves in terms of feature coverage with current cloud-based incumbents. Being open-sourced enables us to go faster, but also deeper.
The text was updated successfully, but these errors were encountered:
description: 'Here''s what''s coming in the next few days, weeks, months, and years!'
Roadmap
Coming within a few days
Check out our Roadmap for Core on GitHub. You'll see the features we're currently working on or about to. You may also give us insights, by adding your own issues and voting for specific features / integrations.
Coming within a few weeks / months
We understand that we're not "production-ready" for a lot of companies yet. In the end, we just got started in May 2019, so we're at the beginning of the journey. Here is a highlight of the main features we are planning on releasing in the next few months:
Landing in December 2021 or so:
Coming a bit later until July 2022 :
Our goal is to become "production-ready" for any company whatever their data stack, infrastructure, architecture, data volume, and connector needs.
Coming within a few quarters / years
We also wanted to share with you how we think about the high-level roadmap over the next few months and years. We foresee several high-level phases that we will try to share here.
1. Parity on data consolidation (ELT) in warehouses / databases
Our first focus is to support batch-type ELT integrations. We feel that we can provide value right away as soon as we support one of the integrations you need. Batch integrations are also easier to build and sustain. So we would rather start with that.
Before we move on to the next phase, we want to make sure we are supporting all the major integrations and that we are in a state where we can address the long tail, with the help of the community.
We also want to fully integrate with the open-source ecosystem, including Airflow, dbt, Kubernetes, GreatExpectations, etc., so teams have the ability to fully build the data infrastructure they need.
2. Reverse-ETL from warehouses / databases
Some integrations we have in mind are batch distribution integrations, from warehouses to third-party tools. For instance, a use case could be if your marketing team wants to send back the data to your ad platforms, so it can better optimize the campaigns. Another use case could be syncing the consolidated data back to your CRM.
It’s not yet clear in our minds when to prioritize those additional integrations. We will have a better idea once we see the feedback we get from the community we build with data consolidation.
**3. Parity on data catalogue in warehouses / databases **
we will consolidate all the data types into data lakes, and build a data catalogue to help you search and analyse your data resources.
3. Cross-cloud data assets management to realize distributed data-business synergy
we will build a data consistence infrastructure to achieve data migration and synchronization cloud input and output.
4. Expand on all data engineering features
This is when we will start differentiating ourselves in terms of feature coverage with current cloud-based incumbents. Being open-sourced enables us to go faster, but also deeper.
The text was updated successfully, but these errors were encountered: