creating a dataframe for a delta table between two versions to use with merge #2409
Unanswered
krishna-prasad-s
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm working on a merge activity where I want to replicate the checkpoint behaviour of structured spark streaming.
I am working with delta tables and I have access to table versioning.
I am thinking to apply the version number as an alternative to structured spark streaming's checkpoint.
can we create a dataframe (to be used with the merge function) buy querying between versions.
for e.g.
pub async fn get_df_from_version(&mut self, table_name: &str, version: i64) -> DataFrame { let sql = &format!("SELECT * FROM {} VERSION BETWEEN {} AND current", table_name, version); self.ctx.sql(sql).await.unwrap() }
or is there any better alternative?
Beta Was this translation helpful? Give feedback.
All reactions