-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
* 791 changes * Updates * Updates --------- Co-authored-by: saudsami <[email protected]> Co-authored-by: Saud <[email protected]>
- Loading branch information
1 parent
86905ee
commit 3d97226
Showing
10 changed files
with
324 additions
and
0 deletions.
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
12 changes: 12 additions & 0 deletions
12
extensions-marketplace/develop/integrate/dubbing-voice-changer.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
--- | ||
title: "Dabi AI Dubbing Voice Changer" | ||
sidebar_position: 3 | ||
type: docs | ||
description: > | ||
Dabi AI voice changing plug-in is an encapsulation of the core API of real-time AI voice conversion. | ||
--- | ||
export const toc = [{}]; | ||
|
||
import DubbingVoiceChanger from '@docs/shared/extensions-marketplace/dubbing-voice-changer/index.mdx'; | ||
|
||
<DubbingVoiceChanger /> |
56 changes: 56 additions & 0 deletions
56
shared/extensions-marketplace/dubbing-voice-changer/index.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,56 @@ | ||
import ProjectImplementation from '@docs/shared/extensions-marketplace/dubbing-voice-changer/project-implementation/index.mdx'; | ||
import Reference from '@docs/shared/extensions-marketplace/dubbing-voice-changer/reference/index.mdx'; | ||
|
||
<Vg k="RTEE_DUB" /> extension enables you to change the audio timbre. | ||
|
||
<PlatformWrapper notAllowed="android, ios"> | ||
**This extension is not yet available for <Vpl k="NAME" />.** | ||
</PlatformWrapper> | ||
|
||
<PlatformWrapper notAllowed="web, macos, windows, flutter, react-native"> | ||
This page shows you how to integrate the <Vg k="RTEE_DUB" /> extension in your app. | ||
|
||
## Understand the tech | ||
|
||
<PlatformWrapper platform="android"> | ||
Using the <Link to="{{Global.API_REF_ANDROID_ROOT}}/class_irtcengine.html#api_irtcengine_setextensionproperty">setExtensionProperty</Link> method provided in <Vg k="COMPANY" /> SDK v<Vg k="VSDK_RELEASE_API" /> and passing in the specified `key` and `value` parameters, you can quickly integrate real-time AI voice conversion ability into your <Vpl k="CLIENT" />. | ||
|
||
The `key` parameter you specify in the `setExtensionProperty` method corresponds to the name of the API, and the `value` parameter wraps some or all of the parameters of the API in JSON format. By passing in the specified `key` and `value` parameters, you can call the corresponding API to realize the functions related to real-time AI sound conversion. | ||
</PlatformWrapper> | ||
|
||
<PlatformWrapper platform="ios"> | ||
Using the <Link to="{{Global.API_REF_IOS_ROOT}}/class_irtcengine.html#api_irtcengine_setextensionproperty">setExtensionPropertyWithVendor</Link> method provided in <Vg k="COMPANY" /> SDK v<Vg k="VSDK_RELEASE_API" /> and passing in the specified `key` and `value` parameters, you can quickly integrate real-time AI voice conversion ability into your <Vpl k="CLIENT" />. | ||
|
||
The `key` parameter you specify in the `setExtensionPropertyWithVendor` method corresponds to the name of the API, and the `value` parameter wraps some or all of the parameters of the API in JSON format. By passing in the specified `key` and `value` parameters, you can call the corresponding API to realize the functions related to real-time AI sound conversion. | ||
</PlatformWrapper> | ||
|
||
## Prerequisites | ||
|
||
Ensure that your development environment meets the following requirements: | ||
|
||
<PlatformWrapper platform="android"> | ||
- Android Studio 4.1 or later. | ||
- A physical device (not an emulator) running Android 5.0 or later. | ||
</PlatformWrapper> | ||
|
||
<PlatformWrapper platform="ios"> | ||
- Xcode 9.0 or later. | ||
- A physical device (not an emulator) running iOS 8.0 or later. | ||
</PlatformWrapper> | ||
- <Vg k="RTEE_DUB" /> is used with <Vg k="COMPANY" /> <Vg k="ASDK" /> v<Vg k="VSDK_RELEASE_API" />. | ||
|
||
Refer to the [SDK quickstart](/voice-calling/get-started/get-started-sdk) to integrate <Vg k="ASDK" /> v<Vg k="VSDK_RELEASE_API" /> and implement basic voice calling. | ||
|
||
## Integrate the extension | ||
|
||
This section shows you how to integrate the <Vg k="RTEE_DUB" /> extension, and call the core API to implement the voice changing function. | ||
|
||
<ProjectImplementation /> | ||
|
||
## Reference | ||
|
||
This section contains information that completes the information in this page, or points you to documentation that explains other aspects to this product. | ||
|
||
<Reference /> | ||
|
||
</PlatformWrapper> |
118 changes: 118 additions & 0 deletions
118
...extensions-marketplace/dubbing-voice-changer/project-implementation/android.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,118 @@ | ||
<PlatformWrapper platform="android"> | ||
|
||
1. **Integrate the <Vg k="RTEE_DUB" /> extension** | ||
|
||
To integrate the extension in your project: | ||
|
||
1. Enter the [Agora console > Cloud Market](https://console.agora.io/marketplace/list/all) page and download the <Vpl k="NAME" /> **<Vg k="RTEE_DUB" />** extension package. After unzipping, save all `.aar` files to the project folder `/app/libs`. | ||
1. Get the following resource files and save them in the same directory of the project folder. For example, a new `vc_model` directory: | ||
- License file and tone files: Contact <Vg k="COMPANY" /> to obtain these files. The suffix of the tone file is `.dat`, and the tone file is issued according to the license. | ||
- Model file: Download the required resources. | ||
![Resource Files](/images/extensions-marketplace/resource-files.png) | ||
1. Open the `app/build.gradle` file and add the following lines under `dependencies`: | ||
|
||
```java | ||
implementation fileTree(dir: "libs", include: ["*.jar", "*.aar"]) | ||
``` | ||
|
||
1. **Enable the plugin** | ||
|
||
After creating and initializing `RtcEngine`, call `enableExtension` to enable the plug-in, before you call other APIs such as `enableVideo` and `joinChannel`. | ||
|
||
```kotlin | ||
// Declare parameters | ||
private val EXTENSION_NAME = "dubbing_vc" | ||
private val EXTENSION_VENDOR_NAME = "Dubbing" | ||
private val EXTENSION_AUDIO_FILTER = "DubbingVC" | ||
|
||
private val changeSpeaker_ = "changeSpeaker" | ||
private val startRealTimeTranscribe_ = "startRealTimeTranscribe" | ||
private val stopRealTimeTranscribe_ = "stopRealTimeTranscribe" | ||
private val getSpeakersInfo_ = "getSpeakersInfo" | ||
|
||
val config = RtcEngineConfig() | ||
config.mContext = baseContext | ||
config.mAppId = appId | ||
config.mEventHandler = mRtcEventHandler | ||
// Load plugin | ||
config.addExtension(EXTENSION_NAME) | ||
config.mExtensionObserver = extensionObserver | ||
// Create and initialize RtcEngine | ||
mRtcEngine = RtcEngine.create(config) | ||
mRtcEngine.enableAudio() | ||
// Enable plugin | ||
mRtcEngine.enableExtension(EXTENSION_VENDOR_NAME, EXTENSION_AUDIO_FILTER, enable) | ||
``` | ||
|
||
1. **Set the resource file path** | ||
|
||
When you download and integrate the plug-in, the license, sound, and model files are saved to the specified directory. In this step, you specify the path where these resource files are located. | ||
|
||
```kotlin | ||
val modelPath: String = "${context.filesDir}${File.separator}vc_model" | ||
``` | ||
|
||
1. **Get the tone list** | ||
|
||
After receiving the `onExtensionStarted` callback from <Vg k="COMPANY" /> SDK, call `getExtensionProperty` passing in the `getSpeakersInfo` key to get the timbre list: | ||
|
||
```kotlin | ||
val speakerList = mRtcEngine.getExtensionProperty( | ||
EXTENSION_VENDOR_NAME, | ||
EXTENSION_AUDIO_FILTER, | ||
getSpeakersInfo_ | ||
) | ||
// Convert JSON to array | ||
val arr = JSONArray(speakerList) | ||
``` | ||
|
||
The timbre list is returned as JSON data that you parse yourself. | ||
|
||
1. **Start changing** | ||
|
||
Call `setExtensionProperty` and pass in the corresponding `key` and `value`: | ||
|
||
```kotlin | ||
mRtcEngine.setExtensionProperty( | ||
EXTENSION_VENDOR_NAME, | ||
EXTENSION_AUDIO_FILTER, | ||
startRealTimeTranscribe_, | ||
"true" | ||
) | ||
``` | ||
|
||
1. **Select timbre** | ||
|
||
Pass in the timbre ID from the obtained timbre list to set the corresponding timbre: | ||
|
||
```kotlin | ||
mRtcEngine.setExtensionProperty( | ||
EXTENSION_VENDOR_NAME, | ||
EXTENSION_AUDIO_FILTER, | ||
changeSpeaker_, | ||
id | ||
) | ||
``` | ||
|
||
1. **Stop changing** | ||
|
||
Call the API to stop the voice changer: | ||
|
||
```kotlin | ||
mRtcEngine.setExtensionProperty( | ||
EXTENSION_VENDOR_NAME, | ||
EXTENSION_AUDIO_FILTER, | ||
stopRealTimeTranscribe_, | ||
"true" | ||
) | ||
``` | ||
|
||
1. **Release resources** | ||
|
||
Close the plug-in and release the resources used by the plug-in: | ||
|
||
```kotlin | ||
mRtcEngine.enableExtension(EXTENSION_VENDOR_NAME, EXTENSION_AUDIO_FILTER, false) | ||
``` | ||
|
||
</PlatformWrapper> |
5 changes: 5 additions & 0 deletions
5
...d/extensions-marketplace/dubbing-voice-changer/project-implementation/index.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
import Android from './android.mdx'; | ||
import Ios from './ios.mdx'; | ||
|
||
<Android /> | ||
<Ios /> |
85 changes: 85 additions & 0 deletions
85
shared/extensions-marketplace/dubbing-voice-changer/project-implementation/ios.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,85 @@ | ||
<PlatformWrapper platform="ios"> | ||
|
||
1. **Integrate the <Vg k="RTEE_DUB" /> extension** | ||
|
||
To integrate the extension in your project: | ||
|
||
1. Enter the [Agora console > Cloud Market](https://console.agora.io/marketplace/list/all) page and download the <Vpl k="NAME" /> plug-in package for **<Vg k="RTEE" />** plug-in. | ||
1. Unzip the folder, import all `.framework` files into the project, and modify `Embed` to `Embed & Sign`. | ||
1. Get the following resource files and save them in the same directory of the project folder (For example, Resource path): | ||
- License file and tone files: Contact <Vg k="COMPANY" /> to obtain these files. The suffix of the tone file is named `.dat`, and the tone file is issued according to the license. | ||
- Model file: Download the required resources. | ||
![Resource Files](/images/extensions-marketplace/resource-files.png) | ||
1. Import the `libc++` system library and third-party dynamic libraries into the project `MJExtension`. | ||
|
||
1. **Enable the plugin** | ||
|
||
After creating and initializing `RtcEngine`, call `enableExtensionWithVendor` to enable the plug-in, before you call other APIs such as `enableVideo`, and `joinChannelByToken` | ||
|
||
```objective-c | ||
AgoraRtcEngineConfig *cfg = [[AgoraRtcEngineConfig alloc] init]; | ||
cfg.appId = appId; | ||
cfg.eventDelegate = self; | ||
cfg.channelProfile = AgoraChannelProfileLiveBroadcasting; | ||
cfg.audioScenario = AgoraAudioScenarioGameStreaming; | ||
self.agoraKit = [AgoraRtcEngineKit sharedEngineWithConfig:cfg delegate:self]; | ||
[self.agoraKit setClientRole:AgoraClientRoleBroadcaster]; | ||
|
||
[self.agoraKit enableExtensionWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" enabled:YES]; | ||
``` | ||
|
||
1. **Set the resource file path** | ||
|
||
When you download and integrate the plug-in, the license, sound, and model files are saved to the specified directory. In this step, you specify the path where these resource files are located. | ||
|
||
```objective-c | ||
NSString *sourcePath = [[NSBundle mainBundle] resourcePath]; | ||
[self.agoraKit setExtensionPropertyWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" key:@"setResourcesFilePath" value:sourcePath]; | ||
``` | ||
|
||
1. **Get the tone list** | ||
|
||
After receiving the `onExtensionStarted` callback from <Vg k="COMPANY" /> SDK, call `getExtensionPropertyWithVendor` passing in the `getSpeakersInfo` key to get the timbre list: | ||
|
||
```objective-c | ||
NSString *voices = [self.agoraKit getExtensionPropertyWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" key:@"getSpeakersInfo"]; | ||
// Convert JSON to array | ||
NSData *nsData = [voices dataUsingEncoding:NSUTF8StringEncoding]; | ||
_speakerArray = [NSJSONSerialization JSONObjectWithData:nsData options:kNilOptions error:nil]; | ||
``` | ||
|
||
The timbre list is returned as JSON data that you parse yourself. | ||
|
||
1. **Start changing** | ||
|
||
Call `setExtensionPropertyWithVendor` and pass in the corresponding `key` and `value`: | ||
|
||
```objective-c | ||
[self.agoraKit setExtensionPropertyWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" key:@"startRealTimeTranscribe" value:@"startRealTimeTranscribe"]; | ||
``` | ||
|
||
1. **Select timbre** | ||
|
||
Pass in the timbre ID from the obtained timbre list to set the corresponding timbre: | ||
|
||
```objective-c | ||
[self.agoraKit setExtensionPropertyWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" key:@"changeSpeaker" value:@“”]; | ||
``` | ||
|
||
1. **Stop changing** | ||
|
||
Call the API to stop the voice changer: | ||
|
||
```objective-c | ||
[self.agoraKit setExtensionPropertyWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" key:@"stopRealTimeTranscribe" value:@"stopRealTimeTranscribe"]; | ||
``` | ||
|
||
1. **Release resources** | ||
|
||
Close the plug-in and release the resources used by the plug-in: | ||
|
||
```objective-c | ||
[self.agoraKit enableExtensionWithVendor:[EngineFilterManager vendorName] extension:@"RealTimeTranscribe" enabled:NO]; | ||
``` | ||
|
||
</PlatformWrapper> |
21 changes: 21 additions & 0 deletions
21
shared/extensions-marketplace/dubbing-voice-changer/reference/android.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
<PlatformWrapper platform="android"> | ||
|
||
### Sample project | ||
|
||
The <Vg k="RTEE_DUB" /> extension provides a [GitHub sample project](https://github.com/Dubbing-AI-Voice-Changer/DubbingAgoraDemo/tree/main/Android) for testing. To clone and run the project: | ||
|
||
1. Clone the repository | ||
|
||
```sh | ||
git clone https://github.com/Dubbing-AI-Voice-Changer/DubbingAgoraDemo.git | ||
``` | ||
|
||
2. Refer to the `README.md` file in the repository to run the demo. | ||
|
||
### API reference | ||
|
||
- <Link to="{{Global.API_REF_ANDROID_ROOT}}/rtc_api_data_type.html#api_irtcengine_addextension">addExtension</Link> in the `RtcEngineConfig` class | ||
- <Link to="{{Global.API_REF_ANDROID_ROOT}}/class_irtcengine.html#api_irtcengine_enableextension">enableExtension</Link> in the `RtcEngine` class | ||
- <Link to="{{Global.API_REF_ANDROID_ROOT}}/class_irtcengine.html#api_irtcengine_setextensionproperty">setExtensionProperty</Link> in the `RtcEngine` class | ||
|
||
</PlatformWrapper> |
5 changes: 5 additions & 0 deletions
5
shared/extensions-marketplace/dubbing-voice-changer/reference/index.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
import Android from './android.mdx'; | ||
import Ios from './ios.mdx'; | ||
|
||
<Android /> | ||
<Ios /> |
21 changes: 21 additions & 0 deletions
21
shared/extensions-marketplace/dubbing-voice-changer/reference/ios.mdx
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
<PlatformWrapper platform="ios"> | ||
|
||
### Sample project | ||
|
||
The <Vg k="RTEE_DUB" /> extension provides a [GitHub sample project](https://github.com/Dubbing-AI-Voice-Changer/DubbingAgoraDemo/tree/main/iOS) for testing. To clone and run the project: | ||
|
||
1. Clone the repository | ||
|
||
```sh | ||
git clone https://github.com/Dubbing-AI-Voice-Changer/DubbingAgoraDemo.git | ||
``` | ||
|
||
2. Refer to the `README.md` file in the repository to run the demo. | ||
|
||
|
||
### API reference | ||
|
||
- <Link to="{{Global.API_REF_IOS_ROOT}}/class_irtcengine.html#api_irtcengine_enableextension">enableExtensionWithVendor</Link> in the `AgoraRtcEngineKit` class | ||
- <Link to="{{Global.API_REF_IOS_ROOT}}/class_irtcengine.html#api_irtcengine_setextensionproperty">setExtensionPropertyWithVendor</Link> in the `AgoraRtcEngineKit` class | ||
|
||
</PlatformWrapper> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters