-
Notifications
You must be signed in to change notification settings - Fork 381
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: Create an example API. #3869
base: main
Are you sure you want to change the base?
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,39 @@ | ||
syntax = "proto3"; | ||
|
||
package jj_api.objects; | ||
|
||
// Leaving this as a proto in case we come up with some sort of revset-style | ||
// syntax for operations. | ||
message OperationRef { | ||
string id = 1; | ||
} | ||
|
||
enum Snapshot { | ||
SNAPSHOT_UNSPECIFIED = 0; | ||
NO_SNAPSHOT = 1; | ||
SNAPSHOT = 2; | ||
} | ||
|
||
message GlobalOptions { | ||
// Generally prefer providing working_dir over repo_path, since working_dir | ||
// allows jj to infer the path if you're not at the repo root. | ||
string working_dir = 1; | ||
string repo_path = 2; | ||
|
||
// Technically this could be a boolean, but this forces users to choose either | ||
// SNAPSHOT or NO_SNAPSHOT. | ||
Snapshot snapshot = 3; | ||
|
||
OperationRef operation = 4; | ||
|
||
// If true, user configuration will be loaded. In particular: | ||
// * User revset aliases will be used | ||
// * User template aliases will be used | ||
bool use_user_config = 5; | ||
|
||
// Path to additional config files. | ||
// Will be applied after the user configuration, if use_user_config is set. | ||
repeated string extra_configs = 6; | ||
|
||
bool ignore_immutable = 7; | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,70 @@ | ||
syntax = "proto3"; | ||
|
||
package jj_api.objects; | ||
|
||
// A RevisionMask controls how much of a revision is returned. | ||
// For example, `jj log` will likely not care about files, while `jj status` | ||
// will want to know what files changed in a revision, and `jj diff` will want | ||
// to know both the content of the file and the content of the file in the | ||
// parent revision. | ||
message RevisionMask { | ||
// Which files will be returned. | ||
enum FilePathMask { | ||
NONE = 0; | ||
MODIFIED_FILES = 1; | ||
ALL_FILES = 3; | ||
// Include all files that were matched by the RevisionMask containing this | ||
// RevisionMask. For example, if we had: | ||
// RevisionMask( | ||
// files_to_include = MODIFIED_FILES, | ||
// files = FileMask(content = True), | ||
// parents = RevisionMask( | ||
// files_to_include = PARENT_FILES | ||
// files = FileMask(content = True), | ||
// ), | ||
// ) | ||
// And we returned the revision @, which modfied the file foo, then both | ||
// r.files["foo"].content and r.parent.files["foo"].content would be filled. | ||
PARENT_FILES = 4; | ||
}; | ||
FilePathMask files_to_include = 1; | ||
repeated string additional_files = 2; | ||
|
||
bool rendered = 3; | ||
|
||
bool file_content = 4; | ||
bool file_hash = 5; | ||
bool file_metadata = 6; | ||
|
||
// How much of the parent revision to fill in. | ||
RevisionMask parents = 7; | ||
} | ||
|
||
message File { | ||
// If you query for MODIFIED_FILES, you may get a file that's been deleted. | ||
bool exists = 1; | ||
string hash = 2; | ||
bytes content = 3; | ||
// TODO: file metadata (eg. permissions)? | ||
} | ||
|
||
|
||
// By default, | ||
message Revision { | ||
string change_id = 1; | ||
string commit_id = 2; | ||
string description = 3; | ||
bool empty = 4; | ||
bool conflicts = 5; | ||
bool mutable = 6; | ||
|
||
// The rendered template. | ||
// Open question: Is a template a first-class citizen of jj, or is it specific | ||
// to jj-cli? | ||
string rendered = 7; | ||
Comment on lines
+62
to
+64
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Imo, template rendering should be a first-class citizen of the jj client/CLI. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Pulling @martinvonz's comment into here to discuss further in a single thread.
Seems the two of you are disagreeing here, so I'll list what I think the consequences are of each option:
After thinking about it in more detail, I think that adding it as a first-class citizen is probably my preferred choice. I can imagine that, for example, an extension such as
And instead print:
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I thought the conclusion from the design doc was that the RPC is basically for adding new UIs while the Rust API is usually required for adding new commands. Did I misunderstand/misremember? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This sounds like some miscommunication went on. I suspect that my My assumption was that:
For example, if we chose to implement gerrit support as an extension rather than natively in jj, I assumed that it might look like (very pseudocode): #!/usr/bin/env python3
import jj
args = argparse.ArgumentParser(...).parse_args()
client = jj.client()
transaction = client.StartTransaction()
rev = client.GetRevision(jj.RevisionRequest(transaction=transaction, revset=args.revset))
rev = client.UpdateDescription(transaction=transaction, revision=rev, description=client.description + "Change-ID: ...")
client.EndTransaction(transaction)
subcommand.run(["git", "push", "--remote", "gerrit", f"{rev.commit_id}:refs/for/main"]
print(f"Sent, {rev} to gerrit for review") My assumption In extension would be a binary external to There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I think API clients should have access to the templates, but as Martin says this may have an perf overhead and it also isn't clear on how to expose them correctly. I think a lightweight variant of them should be sensible for now. The reason for that is, like I've written in the doc to discourage
My opinion on that is that we need general commit metadata anyways, so RE: conclusion conversation, in my mind it isn't done yet so I'd be nice have a conclusion there. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
My thoughts was that you would execute a If you executed
I think you may have misunderstood what my point was here. I was simply saying that an arbitrary extension may want access to templates to change how commits are printed. I was simply suggesting that the user might want to change the template from |
||
|
||
map<string, File> files = 8; | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think this should have a specific RepoPath type, as could model the whole rename tracing API a bit better. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What would a RepoPath type look like? I think it's a good point that we should track renames, but I'm not convinced it should be in the key of the map, since that means the user would need to write something like How would you feel about tracking renames with additional metadata in the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Sounds reasonable to me There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Basically any server, remote or local would benefit from a restricted type with some guarantees. The current design could mean that you can escape the repo/workspace which isn't something we want to support anyway. |
||
|
||
// revision.parents[*].parents is always empty, to avoid recursion. | ||
repeated Revision parents = 9; | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,72 @@ | ||
syntax = "proto3"; | ||
|
||
package jj_api.objects; | ||
|
||
message Revset { | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This should be named BuiltinRevset or so for the first class types and then just Revset for the user input. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not a huge fan of that idea. The user input is just a string, so doesn't need its own type, and I think the term @martinvonz WDYT? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If we allow revsets strings here, then we presumably allow nesting revset strings inside other revset nodes (e.g. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Then how do custom downstreams implement builtins? Imagine if Google implements a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. My assumption was that the custom downstream would just use Alternatively, we could just add a type for a custom function call.
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Yeah, that definitely works.
It would allow downstreams to add it to their "BuiltinRevset" enumeration, although this could also be problematic if it leads to tag overlaps.
I like that solution, but it is very low-level and I'm not sure if that aligns with the stated goal. |
||
oneof kind { | ||
// Should ideally only be used when the user types in their own revset. | ||
// Otherwise, construct it using this proto. | ||
string revset = 1; | ||
Comment on lines
+7
to
+9
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Does this resolve revset aliases? If it does, it implies the server will have to know how to read the user's config. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. (I wrote the below before your reply above) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think it should. If an extension ever wants to take input from a user, I think that the user would likely have the assumption that the revset will work correctly with their aliases. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think there's a reasonable middle ground where we could take in a boolean There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I've added |
||
|
||
// Primitives that we can look up directly. | ||
string change_id = 2; | ||
string commit_id = 3; | ||
|
||
// Operators | ||
Union union = 4; | ||
Intersection intersection = 5; | ||
Revset not = 6; | ||
Between between = 7; | ||
|
||
// Functions | ||
Revset parents = 8; | ||
Revset children = 9; | ||
Ancestors ancestors = 10; | ||
// Descendants is skipped (use Between(from=x, to=None, inclusive=True) | ||
Reachable reachable = 12; | ||
// Connected is skipped (use Between(from=x, to=x, inclusive=True) | ||
bool all = 13; // Use bool for a function with no parameters | ||
bool none = 14; // Use bool for a function with no parameters | ||
Branches branches = 15; | ||
RemoteBranches remote_branches = 16; | ||
|
||
// You get the point. Each builtin function is implemented as a oneof in | ||
// this message. | ||
} | ||
} | ||
|
||
message Union { | ||
repeated Revset srcs = 1; | ||
} | ||
|
||
message Intersection { | ||
repeated Revset srcs = 1; | ||
} | ||
|
||
// Equivalent to from::to when inclusive is set, or from..to when not set. | ||
message Between { | ||
optional Revset from = 1; | ||
optional Revset to = 2; | ||
// If false, equivalent to .. | ||
// If true, equivalent to :: | ||
bool inclusive = 3; | ||
} | ||
|
||
message Ancestors { | ||
Revset srcs = 1; | ||
int32 depth = 2; | ||
} | ||
|
||
message Reachable { | ||
Revset srcs = 1; | ||
Revset domain = 2; | ||
} | ||
|
||
message Branches { | ||
string pattern = 1; | ||
} | ||
|
||
message RemoteBranches { | ||
string branch_pattern = 1; | ||
string remote_pattern = 2; | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,52 @@ | ||
syntax = "proto3"; | ||
|
||
package jj_api.rpc; | ||
|
||
import "objects/options.proto"; | ||
import "objects/revset.proto"; | ||
import "objects/revision.proto"; | ||
|
||
message RevisionsRequest { | ||
// Since this is a read-only request, we don't want to require the user to | ||
// start a transaction. | ||
oneof state { | ||
string transaction_id = 1; | ||
jj_api.objects.GlobalOptions repo_state = 2; | ||
} | ||
|
||
jj_api.objects.RevisionMask revision_mask = 3; | ||
|
||
jj_api.objects.Revset revisions = 4; | ||
|
||
int32 limit = 5; | ||
// TODO: We have three options here: | ||
// 1) Make this a string, and return the string | ||
// 2) Make this similar to Revset, where we can construct it | ||
// 3) Remove this from the API and put it in jj-cli entirely. Templates would, | ||
// instead of formatting a Commit object, format a Revision proto. | ||
Comment on lines
+25
to
+26
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I generally don't like exposing the actual storage type, so a wrapper is my preferred option. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. To clarify, you're saying that you'd prefer that a I'm a little confused because I had assumed that it was mutually exclusive with making templates a first-class citizen of the jj client. Could you explain what you're hoping for in more detail My assumption was that either templates would format a Commit object inside the daemon, and be treated as a first-class citizen:
Or templates would be purely for the use of
And that in this second scenario, they would be unable to be a first-class citizen of the jj-client, since a python extension, for example, would be unable to use the rust-native implementation of In the second scenario, we definitely aren't exposing the actual storage type, but in the first scenario, templates work the same way they currently do, and I don't think that would involve exposing the actual storage type either (the end-user would just receive a string). There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I'm saying that the API server should have a distinct definition of what is stored, as not everyone will want a full commit. See #2988 for my thoughts on that, because I think Google wants to run For the other part I'm gonna answer upthread. |
||
// I don't really like option 2, since it seems that the caller of this could | ||
// always just do that themselves in whatever the language is calling this | ||
// API. | ||
// Option 1 and Option 3 both seem reasonable, and mainly depend on whether we | ||
// think that jj-cli will be the only user of templates. | ||
// I personally am leaning towards option 3, because I think that an extension | ||
// may quite reasonably want to, for example, take advantage of a user's | ||
// jj config file containing custom templates or template aliases. | ||
string template = 6; | ||
} | ||
|
||
message ListRevisionsResponse { | ||
repeated jj_api.objects.Revision revisions = 1; | ||
|
||
// This is useful if you want to perform, for example, `jj diff` between two | ||
// revisions. | ||
string operation_id = 2; | ||
} | ||
|
||
message GetRevisionResponse { | ||
jj_api.objects.Revision revision = 1; | ||
|
||
// This is useful if you want to perform, for example, `jj diff` between two | ||
// revisions. | ||
string operation_id = 2; | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
syntax = "proto3"; | ||
|
||
package jj_api.rpc; | ||
|
||
import "objects/options.proto"; | ||
|
||
message BeginTransactionRequest { | ||
jj_api.objects.GlobalOptions state = 1; | ||
|
||
// If EndTransaction is not called within timeout_seconds, then the daemon | ||
// will abandon the transaction. | ||
uint32 timeout_seconds = 2; | ||
} | ||
|
||
message BeginTransactionResponse { | ||
string transaction_id = 1; | ||
} | ||
|
||
message EndTransactionRequest { | ||
string transaction_id = 1; | ||
} | ||
|
||
message EndTransactionResponse { | ||
string operation_id = 1; | ||
} |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
syntax = "proto3"; | ||
|
||
import "rpc/read_revisions.proto"; | ||
import "rpc/transaction.proto"; | ||
|
||
package jj_api.services; | ||
|
||
// All jj things will go into this service. | ||
service JjService { | ||
rpc BeginTransaction(jj_api.rpc.BeginTransactionRequest) returns (jj_api.rpc.BeginTransactionResponse) {} | ||
rpc EndTransaction(jj_api.rpc.EndTransactionRequest) returns (jj_api.rpc.EndTransactionResponse) {} | ||
|
||
// This RPC will be used by `jj log` | ||
rpc ListRevisions(jj_api.rpc.RevisionsRequest) returns (jj_api.rpc.ListRevisionsResponse) {} | ||
|
||
// This RPC will be used by: | ||
// * cat | ||
// * diff | ||
// * files | ||
// * show | ||
// * status | ||
// It only differs from `ListRevisions` in that it throws an error if there | ||
// isn't exactly one match. | ||
rpc GetRevision(jj_api.rpc.RevisionsRequest) returns (jj_api.rpc.GetRevisionResponse) {} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interesting question! My gut feeling is that templating should not be in the API.
However, if we return all the data any caller might need, it might get unnecessarily slow. For example, it's currently expensive to check if merge commits are empty (because we check that by merging the parents' trees). So we may want to be able to indicate which data we want back. There may be some value in being able to make that conditional like you can in a template (e.g. only care whether a commit is empty if it's mutable). That suggests something like templates, but I'm not sure it will be needed. Maybe callers that care enough can send multiple requests instead (e.g. send an additional request for each mutable commit). Even if we provide some support for conditionals like that, I don't think we should return a rendered string. The the caller might have to parse it. It seems better to return it in some structured form.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm open to adding fields to the
RevisionMask
proto.I'm not a huge fan of this, this seems unnecessarily complex, and I can't see a use case for it. I think that, for example, if the user wanted to retrieve the diff, and thus only wanted to know if the current commit was empty, but needed the parent commit, then they can simply do a
GetRevision
on the current revision, and add a mask for the parents so that the parents and the revision you actually care about return different sets of data.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was thinking of the current CLI as an example user of the API, but I think our conclusion from the design doc was that the current CLI needs more control than we expect to provide via the RPC API.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, sounds like we both understood the conclusion differently here. I understood our conclusion to be that the current CLI currently needs more control than we can expect to provide via the RPC API, but the conclusion I drew was that we would have to do significant refactorings to the current CLI in order to change that. I think, however, that your conclusion is probably better for the short-term, and for now we shouldn't try to migrate jj-cli over.
That being said, I think we should design this with the assumption that if jj-cli needs more control than we expect to provide, then other users may need that as well. Thus, I think we should design this API with the intent that if jj-cli requires more than this API can provide right now, we should attempt to design it so that we can at least add those features to the API in the future so we could migrate.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we're ever going to get to a point where the RPC API is basically as flexible as the Rust API, I would recommend looking at e.g.
jj parallelize
and rewrite that using an hypothetical RPC API (i.e. no need to actually implement the RPC API). I'm not yet convinced that it's feasible to get the RPC API to that point, but I'm also not convinced that it's not. Using some moderately complex existing command likejj parallelize
seems like a good way of figuring whether it is.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The parts I were thinking of as hard were the transaction management and the callback in
transform_descendants()
. I think we previously said the API would be stateless. I see that that has now changed with the transaction methods. Callbacks might be implementable using streaming RPCs. Maybe we'll implement some such state management usingRefCell
and it won't be much of a problem. I'm sure it's all solvable.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I originally planned for the API to be stateless, because I assumed that the transaction would take a lock and didn't want an extension crashing or something similar blocking something else from running.
I'm honestly on the fence and wanted to experiment with both options.
What I like about stateful RPCs
What I like about a 1:1 mapping of RPC <-> jj command
jj parallelize
themselvesI think it's mostly a question of power vs simplicity. Low-level operations as commands give you a lot of power, but are far more difficult to use. I think, to be honest, looking at it now, we need those high-level operations (if a user had to implement parallelize themselves, they're just gonna use the jj cli instead of the jj api).
We could potentially implement both sets, with the high-level commands written in terms of those low-level commands, and I think that's probably the best option, but that will introduce a maintainence burden (not sure how much).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we take the approach of both providing a low-level API and a CLI like API, they should be separate services with different guarantees to not have such a large API surface on a single service.
I agree but think that you still should put your sketch of the high level API up, even if it gets superseded by the low-level API for this PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree that we should focus on the high-level API for now. The level I'm thinking of would be roughly matching the CLI. The purpose is to make it easier to implement UIs and IDE extensions.
We can add lower-level functionality later IMO.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't really care about the design as long as it's usable and I have no clue of any internals here
but as a developer for other projects I can say that what I want from an api right now is just something that outputs information from e.g.
jj diff
,jj st
,jj log
and allows to use basic commandsthis is required to be able to write integrations which are more or less future proof (language independent once that gets implemented) and you don't need to parse ugly text