aps-viewer-ai-render

Introduction

Rendering is a required and sometimes painful part of the process in the lives of designers. With a quick search using the words AI and render you can find a set of solutions already addressing this process with the help of AI models. In this project, we'll share one option to generate a photorealistic scene starting with one image generated from a scene rendered with APS Viewer as input.

Thumbnail

Demo

Read only demo at https://aps-viewer-ai-render.autodesk.io

Prerequisites

This sample leverages the Hubs Browser tutorial as a base to select designs and render those with the Viewer.

From this point, we have the integration with generative AI.

To make this possible, we take advantage of Stable Diffusion.

We've chosen Comfy UI to help us achieve our goal. We strongly you to take some time exploring this amazing tool to work with Stable Diffusion models. You can find a detailed guide at https://stable-diffusion-art.com/comfyui/

To use Comfy UI in a web app through an API, we used comfy.icu for its simplicity.

To summarize, to run this app, you'll need:

Node.js (Long Term Support version is recommended)
APS account and APS app
Provision that in your ACC hub
Comfy.icu account

Setup & Run

Clone this repository: git clone https://github.com/autodesk-platform-services/aps-simple-viewer-nodejs
Go to the project folder: cd aps-simple-viewer-nodejs
Install Node.js dependencies: npm install
Open the project folder in a code editor of your choice
Create a .env file in the project folder, and populate it with the snippet below, replacing <client-id> and <client-secret> with your APS Client ID and Client Secret:

APS_CLIENT_ID="<client-id>"
APS_CLIENT_SECRET="<client-secret>"
APS_CALLBACK_URL="http://localhost:8080/api/auth/callback" # URL your users will be redirected to after logging in with their Autodesk account
COMFY_WORKFLOW_ID=""
COMFY_KEY=""

Run the application, either from your code editor, or by running npm start in terminal
Open http://localhost:8080

When using Visual Studio Code, you can run & debug the application by pressing F5.

How it works

Through Comfy.icu we can easily trigger the same workflow through an http request. We just need to pass a json structure with all the nodes we'll use and its connections, like the sample below:

{
    "prompt": {
      "3": {
          "_meta": {
              "title": "KSampler"
          },
          "inputs": {
              "cfg": 7,
              "seed": 70988865502061,
              "model": [
                  "4",
                  0
              ],
              "steps": 20,
              "denoise": 0.95,
              "negative": [
                  "7",
                  0
              ],
              "positive": [
                  "22",
                  0
              ],
              "scheduler": "karras",
              "latent_image": [
                  "20",
                  0
              ],
              "sampler_name": "dpmpp_sde"
          },
          "class_type": "KSampler"
      },
      "4": {
          "_meta": {
              "title": "Load Checkpoint"
          },
          "inputs": {
              "ckpt_name": "architecturerealmix_v1repair.safetensors"
          },
          "class_type": "CheckpointLoaderSimple"
      },
      "6": {
          "_meta": {
              "title": "CLIP Text Encode (Prompt)"
          },
          "inputs": {
              "clip": [
                  "4",
                  1
              ],
              "text": positiveprompt
          },
          "class_type": "CLIPTextEncode"
      },
      "7": {
          "_meta": {
              "title": "CLIP Text Encode (Prompt)"
          },
          "inputs": {
              "clip": [
                  "4",
                  1
              ],
              "text": negativePrompt
          },
          "class_type": "CLIPTextEncode"
      },
      "8": {
          "_meta": {
              "title": "VAE Decode"
          },
          "inputs": {
              "vae": [
                  "4",
                  2
              ],
              "samples": [
                  "3",
                  0
              ]
          },
          "class_type": "VAEDecode"
      },
      "9": {
          "_meta": {
              "title": "Save Image"
          },
          "inputs": {
              "images": [
                  "8",
                  0
              ],
              "filename_prefix": "ComfyUI"
          },
          "class_type": "SaveImage"
      },
      "14": {
          "_meta": {
              "title": "Preview Image"
          },
          "inputs": {
              "images": [
                  "29",
                  0
              ]
          },
          "class_type": "PreviewImage"
      },
      "19": {
          "_meta": {
              "title": "Load Image"
          },
          "inputs": {
              "image": "image.png",
              "upload": "image"
          },
          "class_type": "LoadImage"
      },
      "20": {
          "_meta": {
              "title": "VAE Encode"
          },
          "inputs": {
              "vae": [
                  "4",
                  2
              ],
              "pixels": [
                  "19",
                  0
              ]
          },
          "class_type": "VAEEncode"
      },
      "22": {
          "_meta": {
              "title": "Apply ControlNet"
          },
          "inputs": {
              "image": [
                  "29",
                  0
              ],
              "strength": 0.8,
              "control_net": [
                  "24",
                  0
              ],
              "conditioning": [
                  "6",
                  0
              ]
          },
          "class_type": "ControlNetApply"
      },
      "24": {
          "_meta": {
              "title": "Load ControlNet Model"
          },
          "inputs": {
              "control_net_name": "control_v11p_sd15_canny_fp16.safetensors"
          },
          "class_type": "ControlNetLoader"
      },
      "29": {
          "_meta": {
              "title": "Canny Edge"
          },
          "inputs": {
              "image": [
                  "19",
                  0
              ],
              "resolution": 1472,
              "low_threshold": 80,
              "high_threshold": 250
          },
          "class_type": "CannyEdgePreprocessor"
      }
    },
    "files":{
      "/input/image.png":signedDownloadURL
    }
}

You can find the json for this workflow in the assets folder (controlnet.json).

The Comfy workflow to generate our images requires some input image that is used as a reference, and some text prompts (one negative and one positive) that are used by the model to compose our output image.

Our sample app implements that just like described below:

The user selects one design from his ACC hub
With the design rendered, the user configure one specific scene by walking to one room, hiding elements, doing cuts...
After clicking in the ImageRenderExtension the process gets triggered.
Through the Viewer's getScreenShot method, we generate the input image and send that to OSS.
As soon as this upload is complete, we generate a signed URL for that input image and send that to comfy.icu together with the input texts.
After the process gets completed, we download the output image and send that also to OSS, in a way the two images can be rendered in our app for comparision.

License

This sample is licensed under the terms of the MIT License. Please see the LICENSE file for full details.

Written by

Joao Martins in/jpornelas, Developer Advocate

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.vscode		.vscode
assets		assets
routes		routes
services		services
wwwroot		wwwroot
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.js		config.js
package-lock.json		package-lock.json
package.json		package.json
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aps-viewer-ai-render

Introduction

Thumbnail

Demo

Prerequisites

Setup & Run

How it works

License

Written by

About

Releases

Packages

Languages

License

amelnait/aps-viewer-ai-render

Folders and files

Latest commit

History

Repository files navigation

aps-viewer-ai-render

Introduction

Thumbnail

Demo

Prerequisites

Setup & Run

How it works

License

Written by

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages