New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Apache Spark - Docs refactoring #2789

Merged

BentsiLeviav merged 12 commits into ClickHouse:main from BentsiLeviav:main

Nov 21, 2024

Contributor

BentsiLeviav commented Nov 13, 2024

As part of our effort to improve Spark's documentation, this PR includes:

split the big file we currently have into 2 separate documentation pages.
Add code examples in Java, Scala, Pyspark, and SparkSQL to the native connector page.
Organize the native connector doc page.

BentsiLeviav added 2 commits

November 13, 2024 18:57


          Split spark jdbc and native connector to separate files

77b007b


          Update sidebars.js for spark to have collapsible doc pages

38391fe

BentsiLeviav requested a review from a team as a code owner

November 13, 2024 17:06

BentsiLeviav requested review from kitop and a team and removed request for a team

November 13, 2024 17:06

mshustov requested review from mzitnik and laeg and removed request for kitop and a team

November 14, 2024 13:53

mshustov reviewed

View reviewed changes

docs/en/integrations/data-ingestion/apache-spark/spark-jdbc.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-jdbc.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-jdbc.md

+              import TOCInline from '@theme/TOCInline';
+              # Spark JDBC
+              One of the most used data sources supported by Spark is JDBC.

Member

mshustov Nov 18, 2024

Do we need to provide any specific recommendations for the JDBC driver version?

Contributor Author

BentsiLeviav Nov 20, 2024

Do you have something specific version in mind?
@mzitnik is there a specific version we recommend on?

Contributor

mzitnik Nov 20, 2024

@mshustov did you meant what version of ClickHouse JDBC we should recommend?

Member

mshustov Nov 20, 2024 •

edited

Loading

did you meant what version of ClickHouse JDBC we should recommend?

yes

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md

		The above examples demonstrate SparkSQL queries, which you can run within your application using any API—Java, Scala, PySpark, or shell.


		## Supported Data Types

Member

mshustov Nov 18, 2024

could you extend the docs with the following sections:

configuration options
adjusting clickhouse settings (if possible, see Spark: Support read with settings spark-clickhouse-connector#367)
logging

BentsiLeviav and others added 2 commits

November 18, 2024 14:28


          Update docs/en/integrations/data-ingestion/apache-spark/spark-native-…

23c2f00

…connector.md

Co-authored-by: Mikhail Shustov <[email protected]>


          Update docs/en/integrations/data-ingestion/apache-spark/spark-native-…

7bd9169

…connector.md

Co-authored-by: Mikhail Shustov <[email protected]>

mshustov mentioned this pull request

Enhance documentations code example ClickHouse/spark-clickhouse-connector#353

Closed

BentsiLeviav added 3 commits

November 20, 2024 14:05


          update jdbc url to clickhouse jdbc docs

af6fcb7


          remove redundant links

dd6dccb


          add catalog config default values + indentation

12d94b5

mzitnik reviewed

View reviewed changes

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated

+              :::important
+              It's essential to include the [clickhouse-jdbc JAR](https://mvnrepository.com/artifact/com.clickhouse/clickhouse-jdbc) with the "all" classifier,
+              as the connector relies on [clickhouse-http](https://mvnrepository.com/artifact/com.clickhouse/clickhouse-http-client) and [clickhouse-client](https://mvnrepository.com/artifact/com.clickhouse/clickhouse-client) —both of which are bundled in clickhouse-jdbc:all.
+              Alternatively, you can add [clickhouse-client JAR](https://mvnrepository.com/artifact/com.clickhouse/clickhouse-client) and [clickhouse-http](https://mvnrepository.com/artifact/com.clickhouse/clickhouse-http-client) individually if you prefer not to use the full JDBC package.

Contributor

mzitnik Nov 20, 2024

IMO, giving two many alternatives can cause confusion.

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Outdated Show resolved Hide resolved

docs/en/integrations/data-ingestion/apache-spark/spark-native-connector.md Show resolved Hide resolved

BentsiLeviav added 4 commits

November 20, 2024 14:59


          copy sbt example

9cb1427


          replace with correct write spark native connector examples


          correct spark sql example with native connector

a532293


          sharpen the sentence to include a compatability indication

28b9ae1

BentsiLeviav requested a review from mzitnik

November 20, 2024 13:33

mzitnik approved these changes

View reviewed changes


          add the latest native connector version to the compatability matrix

050d58e

mshustov approved these changes

View reviewed changes

BentsiLeviav merged commit c2dd4a0 into ClickHouse:main

2 of 3 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet