Spark SQL based Shark CLI #341

liancheng · 2014-06-20T02:38:15Z

Merged PR #337 authored by @chenghao-intel, and then did some refactoring and update, mainly include:

Minimized CatalystContext as we don't care about response code for now, should update after SPARK-2106 is addressed.
CatalystDriver.destroy should call super.destroy.
A bunch of trivial coding style fix (including original Shark code).

Conflicts: project/SharkBuild.scala src/main/scala/shark/SharkServer2.scala

- Moved `CatalystContext` into the right package folder - Moved `getResultSetSchema` out of `CatalystContext` - Made `getResultSetSchema` to return a `Schema` rather than a `TableSchema` - `CatalystDriver.destroy` should call `super.destroy` - Coding style issues

AmplabJenkins · 2014-06-20T02:39:55Z

Merged build triggered.

AmplabJenkins · 2014-06-20T02:40:03Z

Merged build started.

AmplabJenkins · 2014-06-20T02:45:32Z

Merged build finished.

AmplabJenkins · 2014-06-20T02:45:32Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/Shark-Pull-Request-Builder/12211/

rxin · 2014-06-20T06:15:17Z

src/main/scala/org/apache/spark/sql/hive/CatalystContext.scala

+
+import shark.LogHelper
+
+class CatalystContext(sc: SparkContext) extends HiveContext(sc) with LogHelper {


SparkSQLContext?

or JDBCContext?

Hmm... I'd rather just call it SharkContext. Catalyst is a query optimization framework, Spark SQL is more than that, but neither of them is a concept parallel to Hive. And this class is actually not very much related to JDBC.

rxin · 2014-06-20T06:15:48Z

src/main/scala/shark/CatalystDriver.scala

+  override def getSchema: Schema = tableSchema
+
+  override def getResults(res: JArrayList[String]): Boolean = {
+    if(hiveResponse == null) {


space after if

AmplabJenkins · 2014-06-20T21:39:59Z

Merged build triggered.

AmplabJenkins · 2014-06-20T21:40:06Z

Merged build started.

AmplabJenkins · 2014-06-20T21:45:16Z

Merged build finished.

AmplabJenkins · 2014-06-20T21:45:16Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/Shark-Pull-Request-Builder/12212/

liancheng · 2014-06-20T22:45:30Z

@rxin @chenghao-intel, thanks for your comments :) I've removed the alternative Hive execution mode, and addressed issues you brought up. I cleaned up some code that doesn't follow reasonable coding conventions along the way until I realized that most of it was copied from the original Hive codebase. Maybe I should just left them unchanged...

chenghao-intel and others added 25 commits June 3, 2014 16:11

[WIP]initial for sharkclidriver compabiled cli implementation

2c06e38

update the jar dependencies

5a3d9f8

Fix ClassCastException

0c2d7f6

fix bug of cli prompt when switch to hive

0477652

update readme

0afbc0f

Fix bug of getting schema info

ef29e99

Add bug info in the README

6c1d9f5

remove the mistaken commit

3d344d0

enable the cli testing

93b027f

Remove the misktaken commit

d752ed5

Add some document

6e7b4d2

Add CacheRdd reload support

3050f80

Update ReadMe for supporting the cached reload

3e652fe

Output Error Message for HQL

ca6255f

solve the netty / servlet-api jar conflict

b5c031b

Jar conflict & Work around for CliSessionState modified by HiveContext

da57ff6

remove the cached table reload for next PR

b6792db

Minimize the changes for SharkBuild.scala

bf326ff

Put the local maven as the last resolver

a3732b9

remove the unused class

02652cf

Make the unittest work

3470679

Merge remote-tracking branch 'hao/sparkSqlBack' into sparkSqlCli

93ca08a

Conflicts: project/SharkBuild.scala src/main/scala/shark/SharkServer2.scala

Asked Git to ignore downloaded SBT launch jar file

46544c7

Minimized CatalystContext as we don't care response code for now

60c4135

rxin reviewed Jun 20, 2014
View reviewed changes

liancheng added 2 commits June 20, 2014 13:18

Addressed PR comments

94c0825

Deleted unused REPL code

5a7c0a9

liancheng closed this Feb 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark SQL based Shark CLI #341

Spark SQL based Shark CLI #341

liancheng commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

rxin Jun 20, 2014

marmbrus Jun 20, 2014

liancheng Jun 20, 2014

marmbrus Jun 20, 2014

rxin Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

liancheng commented Jun 20, 2014


		import shark.LogHelper

		class CatalystContext(sc: SparkContext) extends HiveContext(sc) with LogHelper {

Spark SQL based Shark CLI #341

Spark SQL based Shark CLI #341

Conversation

liancheng commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

rxin Jun 20, 2014

Choose a reason for hiding this comment

marmbrus Jun 20, 2014

Choose a reason for hiding this comment

liancheng Jun 20, 2014

Choose a reason for hiding this comment

marmbrus Jun 20, 2014

Choose a reason for hiding this comment

rxin Jun 20, 2014

Choose a reason for hiding this comment

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

AmplabJenkins commented Jun 20, 2014

liancheng commented Jun 20, 2014