[WIP] AI video prototype #2959

yondonfu · 2024-01-23T22:15:55Z

Opening this draft PR to kick off CI build process for the WIP ai-video branch. This PR should not be merged as-is and the code on this branch will likely be refactored + cleaned up separately later on.

yondonfu · 2024-01-24T17:15:02Z

d32b1b5 temporarily disables Linux arm64 builds because they fail due to an error related to not being able to find zlib during ffmpeg compilation. This error doesn't occur for amd64 builds so I suspect the issue is related to the amd64 -> arm64 cross-compilation process. I noticed that we compile an arm64 specific version of x264 before compiling ffmpeg - perhaps we have to do something similar with zlib?

zlib is currently required as a dependency as of 133050d#diff-4ae778054809274731b9da0c6a5a869c0bd214e92f954a5c9c39181748c2f175 which enabled the png decoder and image2 muxer which are used to demux + decode a sequence of PNG files so they can be encoded into an mp4 file. Ideally, we would replace the PNG demux/decode component by passing tensors (that represent frames) outputted by a model directly from GPU memory to NVENC using torchaudio.StreamWriter, but torchaudio.StreamWriter doesn't support RGB -> YUV conversion on the GPU yet - it can still encode a larger, less-streaming friendly (my understanding is yuv420p is preferred for streaming) RGB output, but I didn't jump to implement this yet due to current limitations. Until this replacement happens, zlib would be a required dependency to support the temporary PNG demux + decode component.

iameli · 2024-01-24T18:19:53Z

@yondonfu Weird - on release go-livepeer right now zlib is dynamically linked. I'll have a look.

This commit ensures the ai network software is in sync with the main branch.

This commit enables the macos and linux-amd64 builds.

This commit removes some redundant code in the bluild action.

This commit makes a small improvement to the doc comments in the build action. This was done to align the code with https://github.com/livepeer/go-livepeer/pull/3148/files.

This commit released the v0.7.7-ai.3 from the AI subnet software.

#3116) This commit allows Gateways to specify the maximum pricing they are willing to pay for a given capability and model combination. Gateways use a single MaxPrice set at launch with the maxPricePerUnit flag, which serves as the default. Additionally, they can specify a JSON config using the maxPricePerCapability flag to set prices per capability and model ID. The maxPrice per capability can also be adjusted via the setBroadcastConfig endpoint of the CLI webserver.

This commit released the v0.7.7-ai.3 from the AI subnet software.

This commit released the v0.7.8-ai.1 from the AI subnet software.

This commit applies a small code improvement.

This commit removes a redundant command which I introduced in the last commit.

This commit adds support for the new [segment anything 2](https://ai.meta.com/sam2/) pipeline (SAM2) that was added to the AI-worker in [this pull request](livepeer/ai-worker#185). While the new SAM pipeline can also do video segmentation this will be done in a subsequent pull request. Co-authored-by: John | Elite Encoder <[email protected]> Co-authored-by: Peter Schroedl <[email protected]> Co-authored-by: Rick Staa <[email protected]>

This commit released the new AI network software.

…letter (#3170)

This commit updates the ai-worker to the one with the changed worker types.

This reverts commit e01daa2.

This commit ensures that the go-livepeer code uses the new worker classes that were defined in livepeer/ai-worker#191.

This commit updates the AI error handling behavoir so that BadRequest errors are forwarded to the user. Co-authored-by: Rick Staa <[email protected]>

While merging the main branch into the AI branch, the fragile AI selection algorithm broke due to changes in the transcoding selection logic, which the AI algorithm relies on. This commit provides a temporary patch to ensure the selection process continues to function while we work on improving the AI selection algorithm.

…t_errors metrics

Add ochestrator_version tag to ai_request_latency_score and ai_request_errors metrics

This commit adds the necessary changes to support the new LLM pipeline introduced in the [ai-worker](https://github.com/livepeer/ai-worker) version [v0.7.0](https://github.com/livepeer/ai-worker/releases/tag/v0.7.0).

* fix: correct the order of the capability list This commit addresses an issue where the capability list was ordered incorrectly, which was introduced in https://github.com/livepeer/go-livepeer/pull/3114/files. The incorrect ordering caused a break in the SAM2 pipeline, which is now resolved. * Refactor Capabilities to use const values instead of iota * Refactor Capabilities to use const values instead of iota --------- Co-authored-by: Rafał Leszko <[email protected]>

yondonfu changed the title ~~[WIP] AI Video prototype~~ [WIP] AI video prototype Jan 23, 2024

yondonfu force-pushed the ai-video branch 5 times, most recently from feb3f3e to 563d199 Compare January 24, 2024 16:53

yondonfu force-pushed the ai-video branch from 39c4aad to ac4729c Compare January 25, 2024 01:37

yondonfu force-pushed the ai-video branch from 650aae7 to 4cecce3 Compare February 8, 2024 18:43

yondonfu force-pushed the ai-video branch from a436dc3 to c8350af Compare February 20, 2024 17:07

yondonfu force-pushed the ai-video branch from f1718fa to 9f7270e Compare March 14, 2024 21:00

yondonfu added 18 commits March 25, 2024 13:40

server: Add unimplemented AI handler

3ede48f

multi: Add /text-to-image for O

6f29698

core+server: Add /image-to-image to O

e2735cd

core+server: Add /image-to-video for O

8883824

multi: Transcode PNG -> mp4 for image-to-video

92bfa74

server: Impl B -> O image-to-video

be72c37

server: Impl B -> O text-to-image

7cd7913

server: Set Content-Type header on B /image-to-video

abe1b5a

server: Impl B -> O image-to-image

7ec59b3

mod: Bump go-tools to v0.3.5

1822db5

server: Upload to OS for all AI endpoints

52785b2

mod: Bump ai-worker + go-tools

332ecbd

cmd: Add -aiModels to load models

14deb6a

server: Log oapi validation error

289cb49

temp disable CI tests

48f560c

ci+docker: Use go1.21.5

24c1623

docker: Install zlib

d60b801

temp disable CI arm64 builds

b49d503

rickstaa and others added 30 commits August 20, 2024 14:29

Merge branch 'master' into ai-video

5af4bbd

chore: update to version 0.7.7-ai.1

9397d80

This commit ensures the ai network software is in sync with the main branch.

ci(ai): enable mac and linux builds

b51577e

This commit enables the macos and linux-amd64 builds.

ci(ai): cleanup build action (#3147)

c05f23c

This commit removes some redundant code in the bluild action.

ci(ai): improve build ci doc comment

33eed89

This commit makes a small improvement to the doc comments in the build action. This was done to align the code with https://github.com/livepeer/go-livepeer/pull/3148/files.

chore(ai): release version v0.7.7-ai.3

f5c754e

This commit released the v0.7.7-ai.3 from the AI subnet software.

chore(ai): release version v0.7.7-ai.4

d4e4b6d

This commit released the v0.7.7-ai.3 from the AI subnet software.

Merge branch 'master' into ai-video

ca2f2c5

chore(ai): release version v0.7.8-ai.1

b5d351f

This commit released the v0.7.8-ai.1 from the AI subnet software.

refactor(ai): apply small code improvement (#3152)

a4116f7

This commit applies a small code improvement.

refactor(ai): remove redundant comment (#3153)

54b4de2

This commit removes a redundant command which I introduced in the last commit.

chore(ai): release 0.7.8-ai.2

1bae87e

This commit released the new AI network software.

fix(ai): update SAM2 capability description to only upper case first …

c5f5a11

…letter (#3170)

chore(ai): update ai-worker

e01daa2

This commit updates the ai-worker to the one with the changed worker types.

Revert "chore(ai): update ai-worker"

14f7783

This reverts commit e01daa2.

refactor: update worker classes (#3171)

ffb1922

This commit ensures that the go-livepeer code uses the new worker classes that were defined in livepeer/ai-worker#191.

print error message on gateway for bad request lora errors (#3154)

71a2dcf

This commit updates the AI error handling behavoir so that BadRequest errors are forwarded to the user. Co-authored-by: Rick Staa <[email protected]>

Merge branch 'master' into ai-video

9c20ce8

Merge branch 'master' into ai-video

8dd8a88

Add ochestrator_version tag to ai_request_latency_score and ai_reques…

277935b

…t_errors metrics

Merge pull request #3184 from livepeer/ai-video-orch-version-prom-tags

7ae8e2c

Add ochestrator_version tag to ai_request_latency_score and ai_request_errors metrics

Better messages for objectStore errors (#3183)

2cd552f

Small cleanup (#3187)

a49cd1b

feat: add LLM pipeline with stream support (#3114)

80c0ac9

This commit adds the necessary changes to support the new LLM pipeline introduced in the [ai-worker](https://github.com/livepeer/ai-worker) version [v0.7.0](https://github.com/livepeer/ai-worker/releases/tag/v0.7.0).

Release go-livepeer 0.7.9-ai.3 (#3192)

de885a2

Allow running Orchestrator in External Container mode (#3196)

f2e1832

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] AI video prototype #2959

[WIP] AI video prototype #2959

yondonfu commented Jan 23, 2024 •

edited

Loading

yondonfu commented Jan 24, 2024 •

edited

Loading

iameli commented Jan 24, 2024

[WIP] AI video prototype #2959

Are you sure you want to change the base?

[WIP] AI video prototype #2959

Conversation

yondonfu commented Jan 23, 2024 • edited Loading

yondonfu commented Jan 24, 2024 • edited Loading

iameli commented Jan 24, 2024

yondonfu commented Jan 23, 2024 •

edited

Loading

yondonfu commented Jan 24, 2024 •

edited

Loading