feat: DIA-1450: Image Classification support in adala #264

matt-bernstein · 2024-11-25T23:38:57Z

[x] parse input variables to split message based on media type
[x] plumb up vision runtime
[x] generalize error handling
[x] let LabelStudioSkill set input variable types
[x] add tests

codecov-commenter · 2024-11-25T23:53:18Z

Codecov Report

Attention: Patch coverage is 87.09677% with 16 lines in your changes missing coverage. Please review.

Project coverage is 67.97%. Comparing base (97ee09c) to head (9b78da0).
Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
adala/skills/collection/label_studio.py	68.96%	9 Missing ⚠️
adala/runtimes/_litellm.py	92.30%	7 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #264      +/-   ##
==========================================
+ Coverage   66.49%   67.97%   +1.48%     
==========================================
  Files          47       47              
  Lines        2462     2498      +36     
==========================================
+ Hits         1637     1698      +61     
+ Misses        825      800      -25

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

adala/skills/collection/label_studio.py

adala/runtimes/_litellm.py

pakelley · 2024-11-26T20:26:27Z

adala/runtimes/_litellm.py

+
+def split_message_into_chunks(
+    input_template: str, input_field_types: Dict[str, MessageChunkType], **input_fields
+) -> List[MessageChunk]:


I'm tempted to ask "why not make the return type Message?" which leads to what I assume is the reason: this is actually one particular type of Message, and that's importantly different from the str version.
Which leads me to: I'm generally uneasy about us having these two different paths, and ideally it'd be nice to just have like a "normalize message" function that includes both get_messages and split_message_into_chunks.

Generally, it feels like our pre-processing is getting pretty convoluted, and this layer seems like plenty to push us to "just have one normalize function that defines the permitted inputs, and outputs a single well-understood output type" (which I imagine would be a List of Dicts, each with a "role" and "content" entry. I'm torn on whether this should just be a pydantic model or dataclass or something though bc, while it seems clearly cleaner, I feel like we're going to have an annoying amount of serialization/deserialization overhead at that point).

Happy to hop on a call to discuss all this

Yep, exactly, the multimodal vs text-only input paths still seem importantly different enough to me, I don't want to coerce them all into the same pipeline yet without better understanding any more special handling we'll have to do, nor do I want to port existing text-only inputs from str to {'type': 'text', 'text': str} for no reason other than clean code. Definitely feeling the lack of well defined data models here but I'm willing to let this be an "exploration" phase which we follow with a "consolidation" phase

pakelley · 2024-11-26T20:29:42Z

adala/runtimes/_litellm.py

+    current_chunk: Optional[MessageChunk] = None
+
+    def add_to_current_chunk(chunk):
+        nonlocal current_chunk


I'm guessing you've tried already, but would definitely like to avoid all this nonlocal business... I'd think we could accomplish the push_current_chunk behavior by using a generator function and iterating through that? And then for add_to_current_chunk we could accumulate into a var within the generator function, and empty it after yielding?

Or for add_to_current_chunk we could just return the new chunk. So like:

def add_to_chunk(current_chunk, additional_chunk): if current_chunk: current_chunk["text"] += chunk["text"] else: current_chunk = chunk return current_chunk

Then we don't have to reason about nonlocal state in the loop

I can just paste the function bodies 2-3x each, just wanted to make the intent of the code clearer by giving them descriptive names there's no algorithmic reason it has to be like this :)

adala/runtimes/_litellm.py

matt-bernstein added 5 commits November 25, 2024 11:57

feat: DIA-1450: Image Classification support in adala

cf71852

refactor error handling to reuse in vision runtime

1dcd75e

add vision runtime + tests

45425aa

add image input test

60d78d8

add vcr cassette

82bb389

matt-bernstein added 2 commits November 25, 2024 19:11

let label studio passthru input variable type

f4586ba

black

f3f84d8

matt-bernstein requested review from niklub and pakelley November 26, 2024 00:14

robot-ci-heartex marked this pull request as draft November 26, 2024 11:05

niklub reviewed Nov 26, 2024

View reviewed changes

adala/skills/collection/label_studio.py Outdated Show resolved Hide resolved

adala/runtimes/_litellm.py Outdated Show resolved Hide resolved

matt-bernstein marked this pull request as ready for review November 26, 2024 14:07

matt-bernstein added 6 commits November 26, 2024 09:39

use cached_property

9eea592

bugfix for labels+image combination

ef3318f

bugfix: don't forget to postprocess hypertext ner

77bf44b

typo

54148fe

use TypedDict for multimodal messages

bca4ba0

bugfix

17c92cb

matt-bernstein requested a review from niklub November 26, 2024 15:02

make label_interface cached_property

71c07b5

pakelley reviewed Nov 26, 2024

View reviewed changes

friendlier parsing logic

7131627

pakelley reviewed Nov 26, 2024

View reviewed changes

adala/runtimes/_litellm.py Outdated Show resolved Hide resolved

pakelley approved these changes Nov 26, 2024

View reviewed changes

add typedef

9b78da0

matt-bernstein merged commit 70b38bd into master Nov 26, 2024
6 of 8 checks passed

matt-bernstein deleted the fb-dia-1450 branch November 26, 2024 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: DIA-1450: Image Classification support in adala #264

feat: DIA-1450: Image Classification support in adala #264

matt-bernstein commented Nov 25, 2024 •

edited

Loading

codecov-commenter commented Nov 25, 2024 •

edited

Loading

pakelley Nov 26, 2024

matt-bernstein Nov 26, 2024

pakelley Nov 26, 2024

pakelley Nov 26, 2024

matt-bernstein Nov 26, 2024

feat: DIA-1450: Image Classification support in adala #264

feat: DIA-1450: Image Classification support in adala #264

Conversation

matt-bernstein commented Nov 25, 2024 • edited Loading

codecov-commenter commented Nov 25, 2024 • edited Loading

Codecov Report

pakelley Nov 26, 2024

Choose a reason for hiding this comment

matt-bernstein Nov 26, 2024

Choose a reason for hiding this comment

pakelley Nov 26, 2024

Choose a reason for hiding this comment

pakelley Nov 26, 2024

Choose a reason for hiding this comment

matt-bernstein Nov 26, 2024

Choose a reason for hiding this comment

matt-bernstein commented Nov 25, 2024 •

edited

Loading

codecov-commenter commented Nov 25, 2024 •

edited

Loading