Add base moderation to Providers #513

zakiali · 2024-12-20T20:48:15Z

Adding in moderation for user inputs.

A Moderation trait is added to the base Provider struct along
Update the base provider complete method to make a concurrent request to both the moderation and LLM endpoint. If moderation fails, we error out and abort the LLM completion if the request is still in flight. If moderation passes, we wait for the LLM response and continue as normal
Only moderates on the latest User message
Implemented moderation for the OpenAI provider using the openai's moderation endpoint, as an example

ahau-square · 2025-01-07T23:43:58Z

crates/goose/src/providers/openai.rs

+
+        let response_json: serde_json::Value = response.json().await?;
+
+        let flagged = response_json["results"][0]["flagged"]


should we check response status before parsing? I think we use handle_response elsewhere

ahau-square · 2025-01-07T23:48:11Z

crates/goose/src/providers/base.rs

+
+        // Get the content to moderate
+        let content = latest_user_msg.content.first().unwrap().as_text().unwrap();
+        println!("Content to moderate: {}", content);


…tion/completion calls

github-actions · 2025-01-10T01:34:55Z

Desktop App for this PR

The following build is available for testing:

📱 macOS Desktop App (Universal, signed)

The app is signed and notarized for macOS. After downloading, unzip the file and drag the Goose.app to your Applications folder.

This link is provided by nightly.link and will work even if you're not logged into GitHub.

github-actions · 2025-01-10T01:47:30Z

Desktop App for this PR

The following build is available for testing:

📱 macOS Desktop App (Universal, signed)

The app is signed and notarized for macOS. After downloading, unzip the file and drag the Goose.app to your Applications folder.

This link is provided by nightly.link and will work even if you're not logged into GitHub.

* v1.0: feat: static settings page (#570) styles: adding arcade styles and cash sans (#581) feat: quick spinner while loading tokenizers (#573) Add timeout middleware for clients (#572) chore: remove unused old setup for CLI (#574) feat: env and secrets configuration for mcp server (#565) Add Databricks moderation (#540) feat: add pagination support for tools/list and resources/list (#566) Add resource capabilties to MCP servers that use it (#576) Add goose versions to the UI (#526) fix: Set stdin to null in shell/bash tools (#568) Add base moderation to Providers (#513) feat: set process_group(0) on stdio systems to avoid ctrl-c handling (#567) feat: read only active resources in the agent loop (#560)

zakiali force-pushed the zaki/moderation branch from eb54b6b to b2023fc Compare December 20, 2024 22:25

zakiali marked this pull request as ready for review December 20, 2024 22:26

zakiali force-pushed the zaki/moderation branch from b2023fc to cbab83c Compare December 23, 2024 18:17

zakiali force-pushed the zaki/moderation branch 7 times, most recently from 2199bad to e89e10d Compare January 3, 2025 19:00

zakiali requested review from baxen, ahau-square and salman1993 January 6, 2025 19:13

ahau-square approved these changes Jan 7, 2025

View reviewed changes

zakiali force-pushed the zaki/moderation branch 8 times, most recently from 7ded65b to 92fd00a Compare January 10, 2025 01:28

zakiali added 7 commits January 9, 2025 17:28

Add Moderation trait to Provider and rework complete for async modera…

7824a86

…tion/completion calls

Add OpenAI moderation

094b098

format

a5e514f

Fixup error printing

47c84b1

fix tests for Provider

b52e14b

caching moderation results

9edf427

fmt

4cc84de

zakiali force-pushed the zaki/moderation branch from 92fd00a to 924a8ec Compare January 10, 2025 01:28

Add Moderation trait to OpenRouter

d503e1f

zakiali force-pushed the zaki/moderation branch from 924a8ec to d503e1f Compare January 10, 2025 01:40

zakiali merged commit 7b827d0 into v1.0 Jan 10, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add base moderation to Providers #513

Add base moderation to Providers #513

zakiali commented Dec 20, 2024 •

edited

Loading

ahau-square Jan 7, 2025 •

edited

Loading

ahau-square Jan 7, 2025

github-actions bot commented Jan 10, 2025

github-actions bot commented Jan 10, 2025


		let response_json: serde_json::Value = response.json().await?;

		let flagged = response_json["results"][0]["flagged"]

Add base moderation to Providers #513

Add base moderation to Providers #513

Conversation

zakiali commented Dec 20, 2024 • edited Loading

ahau-square Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

ahau-square Jan 7, 2025

Choose a reason for hiding this comment

github-actions bot commented Jan 10, 2025

Desktop App for this PR

github-actions bot commented Jan 10, 2025

Desktop App for this PR

zakiali commented Dec 20, 2024 •

edited

Loading

ahau-square Jan 7, 2025 •

edited

Loading