Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(error): preserve error's source chain across gRPC boundary #13282

Merged
merged 15 commits into from
Nov 10, 2023
Merged
26 changes: 26 additions & 0 deletions Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 2 additions & 0 deletions Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ members = [
"src/compute",
"src/connector",
"src/ctl",
"src/error",
"src/expr/core",
"src/expr/impl",
"src/expr/macro",
Expand Down Expand Up @@ -138,6 +139,7 @@ risingwave_compactor = { path = "./src/storage/compactor" }
risingwave_compute = { path = "./src/compute" }
risingwave_ctl = { path = "./src/ctl" }
risingwave_connector = { path = "./src/connector" }
risingwave_error = { path = "./src/error" }
risingwave_expr = { path = "./src/expr/core" }
risingwave_expr_impl = { path = "./src/expr/impl" }
risingwave_frontend = { path = "./src/frontend" }
Expand Down
3 changes: 2 additions & 1 deletion src/batch/src/error.rs
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ use std::sync::Arc;
pub use anyhow::anyhow;
use risingwave_common::array::ArrayError;
use risingwave_common::error::{ErrorCode, RwError};
use risingwave_rpc_client::error::ToTonicStatus;
use thiserror::Error;
use tonic::Status;

Expand Down Expand Up @@ -79,6 +80,6 @@ impl From<RwError> for BatchError {

impl<'a> From<&'a BatchError> for Status {
fn from(err: &'a BatchError) -> Self {
Status::internal(err.to_string())
err.to_status(tonic::Code::Internal)
}
}
1 change: 1 addition & 0 deletions src/common/Cargo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -73,6 +73,7 @@ rand = "0.8"
regex = "1"
reqwest = { version = "0.11", features = ["json"] }
risingwave_common_proc_macro = { path = "./proc_macro" }
risingwave_error = { workspace = true }
risingwave_pb = { workspace = true }
rust_decimal = { version = "1", features = ["db-postgres", "maths"] }
ryu = "1.0"
Expand Down
66 changes: 38 additions & 28 deletions src/common/src/error.rs
Original file line number Diff line number Diff line change
Expand Up @@ -20,17 +20,14 @@ use std::io::Error as IoError;
use std::time::{Duration, SystemTime};

use memcomparable::Error as MemComparableError;
use risingwave_error::tonic::{ToTonicStatus, TonicStatusWrapper};
use risingwave_pb::PbFieldNotFound;
use thiserror::Error;
use tokio::task::JoinError;
use tonic::Code;

use crate::array::ArrayError;
use crate::util::value_encoding::error::ValueEncodingError;

/// Header used to store serialized [`RwError`] in grpc status.
pub const RW_ERROR_GRPC_HEADER: &str = "risingwave-error-bin";

const ERROR_SUPPRESSOR_RESET_DURATION: Duration = Duration::from_millis(60 * 60 * 1000); // 1h

pub trait Error = std::error::Error + Send + Sync + 'static;
Expand Down Expand Up @@ -126,10 +123,10 @@ pub enum ErrorCode {
#[source]
BoxedError,
),
#[error("RPC error: {0}")]
#[error(transparent)]
RpcError(
#[source]
#[backtrace]
// #[backtrace] // TODO(error-handling): there's a limitation that `#[transparent]` can't be used with `#[backtrace]` if no `#[from]`
// `tonic::transport::Error`, `TonicStatusWrapper`, or `RpcError`
BoxedError,
),
#[error("Bind error: {0}")]
Expand Down Expand Up @@ -195,12 +192,41 @@ pub struct RwError {

impl From<RwError> for tonic::Status {
fn from(err: RwError) -> Self {
match &*err.inner {
ErrorCode::ExprError(e) => tonic::Status::invalid_argument(e.to_string()),
ErrorCode::PermissionDenied(e) => tonic::Status::permission_denied(e),
ErrorCode::InternalError(e) => tonic::Status::internal(e),
_ => tonic::Status::internal(err.to_string()),
use tonic::Code;

let code = match &*err.inner {
ErrorCode::ExprError(_) => Code::InvalidArgument,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ExprError => InvalidArgument doesn't look very correct 🤣

BTW, I was always wondering whether we should map application error code to tonic status code.. This is like the debate whether to use HTTP status code or simply use it as a wrapper and do everything in the payload. I don't have an answer. Just want to mention it. Let's just keep it until somebody is unhappy.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree. I just leave the behavior unchanged in this PR. 😕

whether we should map application error code to tonic status code.

+1. And the only place we refer to it is the error message of Status. 😄

ErrorCode::PermissionDenied(_) => Code::PermissionDenied,
ErrorCode::InternalError(_) => Code::Internal,
_ => Code::Internal,
};

err.to_status(code)
}
}

impl From<TonicStatusWrapper> for RwError {
fn from(status: TonicStatusWrapper) -> Self {
use tonic::Code;

let message = status.inner().message();

// TODO(error-handling): `message` loses the source chain.
match status.inner().code() {
Code::InvalidArgument => ErrorCode::InvalidParameterValue(message.to_string()),
Code::NotFound | Code::AlreadyExists => ErrorCode::CatalogError(status.into()),
Code::PermissionDenied => ErrorCode::PermissionDenied(message.to_string()),
Code::Cancelled => ErrorCode::SchedulerError(status.into()),
_ => ErrorCode::RpcError(status.into()),
}
.into()
}
}

impl From<tonic::Status> for RwError {
fn from(status: tonic::Status) -> Self {
// Always wrap the status.
Self::from(TonicStatusWrapper::new(status))
}
}

Expand Down Expand Up @@ -292,22 +318,6 @@ impl From<PbFieldNotFound> for RwError {
}
}

impl From<tonic::Status> for RwError {
fn from(err: tonic::Status) -> Self {
match err.code() {
Code::InvalidArgument => {
ErrorCode::InvalidParameterValue(err.message().to_string()).into()
}
Code::NotFound | Code::AlreadyExists => {
ErrorCode::CatalogError(err.message().to_string().into()).into()
}
Code::PermissionDenied => ErrorCode::PermissionDenied(err.message().to_string()).into(),
Code::Cancelled => ErrorCode::SchedulerError(err.message().to_string().into()).into(),
_ => ErrorCode::InternalError(err.message().to_string()).into(),
}
}
}

impl From<tonic::transport::Error> for RwError {
fn from(err: tonic::transport::Error) -> Self {
ErrorCode::RpcError(err.into()).into()
Expand Down
22 changes: 22 additions & 0 deletions src/error/Cargo.toml
BugenZhao marked this conversation as resolved.
Show resolved Hide resolved
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
[package]
name = "risingwave_error"
version.workspace = true
edition.workspace = true
homepage.workspace = true
keywords.workspace = true
license.workspace = true
repository.workspace = true

# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html

[dependencies]
bincode = "1"
bytes = "1"
easy-ext = "1"
serde-error = "0.1"
thiserror = "1"
thiserror-ext = { workspace = true }
tonic = { workspace = true }

[lints]
workspace = true
19 changes: 19 additions & 0 deletions src/error/src/lib.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
// Copyright 2023 RisingWave Labs
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

//! Error handling utilities.
//!
//! This will eventually replace the `RwError` in `risingwave_common`.

pub mod tonic;
123 changes: 123 additions & 0 deletions src/error/src/tonic.rs
Original file line number Diff line number Diff line change
@@ -0,0 +1,123 @@
// Copyright 2023 RisingWave Labs
//
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.

use std::error::Error;
use std::sync::Arc;

#[easy_ext::ext(ToTonicStatus)]
impl<T> T
where
T: ?Sized + std::error::Error,
{
/// Convert the error to [`tonic::Status`] with the given [`tonic::Code`].
///
/// The source chain is preserved by pairing with [`TonicStatusWrapper`].
// TODO(error-handling): disallow constructing `tonic::Status` directly with `new` by clippy.
pub fn to_status(&self, code: tonic::Code) -> tonic::Status {
// Embed the whole error (`self`) and its source chain into the details field.
// At the same time, set the message field to the error message of `self` (without source chain).
// The redundancy of the current error's message is intentional in case the client ignores the `details` field.
let source = serde_error::Error::new(self);
let details = bincode::serialize(&source).unwrap_or_default();

let mut status = tonic::Status::with_details(code, self.to_string(), details.into());
BugenZhao marked this conversation as resolved.
Show resolved Hide resolved
// Set the source of `tonic::Status`, though it's not likely to be used.
// This is only available before serializing to the wire. That's why we need to manually embed it
// into the `details` field.
status.set_source(Arc::new(source));
status
}
}

/// A wrapper of [`tonic::Status`] that provides better error message and extracts
/// the source chain from the `details` field.
#[derive(Debug)]
pub struct TonicStatusWrapper(tonic::Status);

impl TonicStatusWrapper {
/// Create a new [`TonicStatusWrapper`] from the given [`tonic::Status`] and extract
/// the source chain from its `details` field.
pub fn new(mut status: tonic::Status) -> Self {
if status.source().is_none() {
if let Ok(e) = bincode::deserialize::<serde_error::Error>(status.details()) {
status.set_source(Arc::new(e));
}
BugenZhao marked this conversation as resolved.
Show resolved Hide resolved
}
Self(status)
}

/// Returns the reference to the inner [`tonic::Status`].
pub fn inner(&self) -> &tonic::Status {
&self.0
}

/// Consumes `self` and returns the inner [`tonic::Status`].
pub fn into_inner(self) -> tonic::Status {
self.0
}
}

impl From<tonic::Status> for TonicStatusWrapper {
fn from(status: tonic::Status) -> Self {
Self::new(status)
}
}

impl std::fmt::Display for TonicStatusWrapper {
fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
write!(f, "gRPC error ({}): {}", self.0.code(), self.0.message())
BugenZhao marked this conversation as resolved.
Show resolved Hide resolved
}
}

impl std::error::Error for TonicStatusWrapper {
fn source(&self) -> Option<&(dyn std::error::Error + 'static)> {
// Delegate to `self.0` as if we're transparent.
self.0.source()
}
}

#[cfg(test)]
mod tests {
use super::*;

#[test]
fn test_ui() {
#[derive(thiserror::Error, Debug)]
#[error("{message}")]
struct MyError {
message: &'static str,
source: Option<Box<MyError>>,
}

let original = MyError {
message: "outer",
source: Some(Box::new(MyError {
message: "inner",
source: None,
})),
};

let server_status = original.to_status(tonic::Code::Internal);
let body = server_status.to_http();
let client_status = tonic::Status::from_header_map(body.headers()).unwrap();

let wrapper = TonicStatusWrapper::new(client_status);
assert_eq!(wrapper.to_string(), "gRPC error (Internal error): outer");

let source = wrapper.source().unwrap();
assert!(source.is::<serde_error::Error>());
assert_eq!(source.to_string(), "outer");
assert_eq!(source.source().unwrap().to_string(), "inner");
}
}
2 changes: 1 addition & 1 deletion src/frontend/src/scheduler/error.rs
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ pub enum SchedulerError {
#[error("Pin snapshot error: {0} fails to get epoch {1}")]
PinSnapshot(QueryId, u64),

#[error("Rpc error: {0}")]
#[error(transparent)]
RpcError(
#[from]
#[backtrace]
Expand Down
Loading
Loading