feat(ai): add pipelines optimization flags #3013

rickstaa · 2024-04-15T14:54:29Z

Warning

Do not merge before livepeer/ai-worker#61.

What does this pull request do? Explain your changes. (required)

This commit adds a new OptimizationFlags field to the aiModels config so that users can forward optimization environment variables to the ai-worker for more information see livepeer/ai-worker#61.

Specific updates (required)

New OptimizationFlags field added to aiModels config.
Flags forwaded to the ai-worker Warm function.
A warning is thrown when users try to use the optimizationFlags with non warm containers.

How did you test each of these updates (required)

Ensured the package builds.
Ensured tests.sh were succesfull.
Spung up off-chain broadcaster.
Spung up off-chain orchestrator with SFAST optimization flag in the models configuration file.

[
    {
        "pipeline": "image-to-video",
        "model_id": "stabilityai/stable-video-diffusion-img2vid-xt-1-1",
        "price_per_unit": 3390842,
        "warm": true,
        "optimization_flags": {
            "SFAST": "true",
            "SOME": true
        }
    }
]

Let the ImageToVideo container warmup and check the container logs to see that SFAST is used and the model is pre-traced.
Send a ImageToVideo request to the broadcaster and checked that it stilled worked.

Does this pull request close any open issues?

No.

Checklist:

Read the contribution guide
make runs successfully
All tests in ./test.sh pass - @rickstaa we still need to fix the tests.
README and other documentation updated
Pending changelog updated

This commit adds a new `OptimizationFlags` field to the `aiModels` config so that users can forward optimization environment variables to the [ai-worker]([email protected]:livepeer/ai-worker.git) for more information see livepeer/ai-worker#61.

rickstaa · 2024-04-15T15:01:16Z

@yondonfu, is there a specific reason why the binary execution proceeds even when an AIModelConfig is not specified? I encountered this issue while trying to activate flags for containers that don't start warm, as I wanted to pass Optimization flags directly to the AIWorker constructor.

go-livepeer/cmd/livepeer/starter/starter.go

Lines 534 to 605 in 1019d42

    
           	if *cfg.AIModels != "" { 
        
           		configs, err := core.ParseAIModelConfigs(*cfg.AIModels) 
        
           		if err != nil { 
        
           			glog.Error("Error parsing -aiModels: %v", err) 
        
           			return 
        
           		} 
        
           		for _, config := range configs { 
        
           			modelConstraint := &core.ModelConstraint{Warm: config.Warm} 
        
           			// If the config contains a URL we call Warm() anyway because AIWorker will just register 
        
           			// the endpoint for an external container 
        
           			if config.Warm || config.URL != "" { 
        
           				endpoint := worker.RunnerEndpoint{URL: config.URL, Token: config.Token} 
        
           				if err := n.AIWorker.Warm(ctx, config.Pipeline, config.ModelID, endpoint); err != nil { 
        
           					glog.Errorf("Error AI worker warming %v container: %v", config.Pipeline, err) 
        
           					return 
        
           				} 
        
           			} 
        
           			switch config.Pipeline { 
        
           			case "text-to-image": 
        
           				_, ok := constraints[core.Capability_TextToImage] 
        
           				if !ok { 
        
           					aiCaps = append(aiCaps, core.Capability_TextToImage) 
        
           					constraints[core.Capability_TextToImage] = &core.Constraints{ 
        
           						Models: make(map[string]*core.ModelConstraint), 
        
           					} 
        
           				} 
        
           				constraints[core.Capability_TextToImage].Models[config.ModelID] = modelConstraint 
        
           				n.SetBasePriceForCap("default", core.Capability_TextToImage, config.ModelID, big.NewRat(config.PricePerUnit, config.PixelsPerUnit)) 
        
           			case "image-to-image": 
        
           				_, ok := constraints[core.Capability_ImageToImage] 
        
           				if !ok { 
        
           					aiCaps = append(aiCaps, core.Capability_ImageToImage) 
        
           					constraints[core.Capability_ImageToImage] = &core.Constraints{ 
        
           						Models: make(map[string]*core.ModelConstraint), 
        
           					} 
        
           				} 
        
           				constraints[core.Capability_ImageToImage].Models[config.ModelID] = modelConstraint 
        
           				n.SetBasePriceForCap("default", core.Capability_ImageToImage, config.ModelID, big.NewRat(config.PricePerUnit, config.PixelsPerUnit)) 
        
           			case "image-to-video": 
        
           				_, ok := constraints[core.Capability_ImageToVideo] 
        
           				if !ok { 
        
           					aiCaps = append(aiCaps, core.Capability_ImageToVideo) 
        
           					constraints[core.Capability_ImageToVideo] = &core.Constraints{ 
        
           						Models: make(map[string]*core.ModelConstraint), 
        
           					} 
        
           				} 
        
           				constraints[core.Capability_ImageToVideo].Models[config.ModelID] = modelConstraint 
        
           				n.SetBasePriceForCap("default", core.Capability_ImageToVideo, config.ModelID, big.NewRat(config.PricePerUnit, config.PixelsPerUnit)) 
        
           			} 
        
           		} 
        
           	} 
        
           	defer func() { 
        
           		ctx, cancel := context.WithTimeout(context.Background(), aiWorkerContainerStopTimeout) 
        
           		defer cancel() 
        
           		if err := n.AIWorker.Stop(ctx); err != nil { 
        
           			glog.Errorf("Error stopping AI worker containers: %v", err) 
        
           			return 
        
           		} 
        
           		glog.Infof("Stopped AI worker containers") 
        
           	}() 
        
           }

rickstaa · 2024-04-15T15:03:07Z

Requires livepeer/ai-worker#61.

yondonfu · 2024-04-15T18:25:06Z

@rickstaa

is there a specific reason why the binary execution proceeds even when an AIModelConfig is not specified?

I think it would sense to require a config file for -aiModels now because the config file is not only used to warm models, but also for specifying prices and determining which models to advertise during capability discovery. Prior to using the config file for configuring prices and capability discovery, the config file was left as optional because the O would just try to execute any request given a model ID. But now, since models have prices and have to be explicitly advertised, the config file is necessary.

yondonfu

Noting for the thread that this will need a go mod update to use the latest version of ai-worker once it is merged.

cmd/livepeer/starter/starter.go

This commit ensures that the https://github.com/livepeer/ai-worker dependency is on the latest commit that includes the new optimization flags feature (see livepeer/ai-worker#61).

rickstaa · 2024-04-16T10:31:41Z

@rickstaa

is there a specific reason why the binary execution proceeds even when an AIModelConfig is not specified?

I think it would sense to require a config file for -aiModels now because the config file is not only used to warm models, but also for specifying prices and determining which models to advertise during capability discovery. Prior to using the config file for configuring prices and capability discovery, the config file was left as optional because the O would just try to execute any request given a model ID. But now, since models have prices and have to be explicitly advertised, the config file is necessary.

Ah thanks for the explanation. Makes sense. I will change the behavoir!

This commit ensures that the `optimzation flag not supported` warning is shown for each model that is not loaded warm.

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

feat(ai): add pipelines optimization flags

52dadff

This commit adds a new `OptimizationFlags` field to the `aiModels` config so that users can forward optimization environment variables to the [ai-worker]([email protected]:livepeer/ai-worker.git) for more information see livepeer/ai-worker#61.

github-actions bot added the AI Issues and PR related to the AI-video branch. label Apr 15, 2024

rickstaa requested a review from yondonfu April 15, 2024 15:02

yondonfu reviewed Apr 15, 2024

View reviewed changes

cmd/livepeer/starter/starter.go Outdated Show resolved Hide resolved

cmd/livepeer/starter/starter.go Outdated Show resolved Hide resolved

chore: update ai-worker to latest commit

bb50339

This commit ensures that the https://github.com/livepeer/ai-worker dependency is on the latest commit that includes the new optimization flags feature (see livepeer/ai-worker#61).

refactor: improve OptFlags logging

fd3397a

This commit ensures that the `optimzation flag not supported` warning is shown for each model that is not loaded warm.

rickstaa merged commit 9502ea0 into ai-video Apr 16, 2024
8 of 9 checks passed

rickstaa deleted the add_optimization_flags branch April 16, 2024 14:19

rickstaa added a commit that referenced this pull request Apr 16, 2024

docs(ai): add optimization flags to docs

5bc47d1

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

rickstaa mentioned this pull request Apr 16, 2024

docs(ai): add optimization flags to docs #3014

Merged

5 tasks

rickstaa added a commit that referenced this pull request Apr 16, 2024

docs(ai): add optimization flags to docs

c5acb64

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

rickstaa added a commit that referenced this pull request Apr 16, 2024

docs(ai): add optimization flags to docs

5587f4c

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

rickstaa added a commit that referenced this pull request Apr 16, 2024

docs(ai): add optimization flags to docs (#3014)

cea4e94

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

This was referenced May 15, 2024

enable nsfw filter #3054

Merged

rename census broadcaster metrics #3055

Merged

handle empty orch response #3058

Merged

ai video deprecate priceperbroadcaster #3061

Merged

rickstaa mentioned this pull request Jun 10, 2024

feat(ai): add upscaling pipeline #3077

Merged

5 tasks

This was referenced Jul 1, 2024

Implementation of AI Remote Worker (AI-323) #3088

Closed

fix I2I latency score #3093

Merged

This was referenced Jul 14, 2024

feat(ai): add AI orchestrator metrics #3097

Merged

add speech to text code improvements #3098

Closed

add num inference steps I2I I2V upscale #3099

Merged

rickstaa mentioned this pull request Sep 23, 2024

selection algorithm transcoding conflict patch #3181

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): add pipelines optimization flags #3013

feat(ai): add pipelines optimization flags #3013

rickstaa commented Apr 15, 2024 •

edited

Loading

rickstaa commented Apr 15, 2024 •

edited

Loading

rickstaa commented Apr 15, 2024

yondonfu commented Apr 15, 2024

yondonfu left a comment

rickstaa commented Apr 16, 2024

feat(ai): add pipelines optimization flags #3013

feat(ai): add pipelines optimization flags #3013

Conversation

rickstaa commented Apr 15, 2024 • edited Loading

rickstaa commented Apr 15, 2024 • edited Loading

rickstaa commented Apr 15, 2024

yondonfu commented Apr 15, 2024

yondonfu left a comment

Choose a reason for hiding this comment

rickstaa commented Apr 16, 2024

rickstaa commented Apr 15, 2024 •

edited

Loading

rickstaa commented Apr 15, 2024 •

edited

Loading