Cross Modal Diffusion Model

Barrett Burnworth

โ˜๏ธ What's up?

Generate Imagery/Video/Audio all at once, in one modelโ€ฆ CWT? ๐Ÿ˜ข

image

https://codi-gen.github.io/static/images/teaser.mp4

https://codi-gen.github.io/

https://www.marktechpost.com/2023/06/25/meet-codi-a-novel-cross-modal-diffusion-model-for-any-to-any-synthesis/

https://arxiv.org/abs/2305.11846

ShortGPT

https://github.com/RayVentura/ShortGPT

๐Ÿ“ Introduction to ShortGPT

ShortGPT is a powerful framework for automating content creation. It simplifies video creation, footage sourcing, voiceover synthesis, and editing tasks.

  • ๐ŸŽž๏ธ Automated editing framework: Streamlines the video creation process with an LLM oriented video editing language.

  • ๐Ÿ“ƒ Scripts and Prompts: Provides ready-to-use scripts and prompts for various LLM automated editing processes.

  • ๐Ÿ—ฃ๏ธ Voiceover / Content Creation: Supports multiple languages including English ๐Ÿ‡บ๐Ÿ‡ธ, Spanish ๐Ÿ‡ช๐Ÿ‡ธ, Arabic ๐Ÿ‡ฆ๐Ÿ‡ช, French ๐Ÿ‡ซ๐Ÿ‡ท, Polish ๐Ÿ‡ต๐Ÿ‡ฑ, German ๐Ÿ‡ฉ๐Ÿ‡ช, Italian ๐Ÿ‡ฎ๐Ÿ‡น, and Portuguese ๐Ÿ‡ต๐Ÿ‡น.

  • ๐Ÿ”— Caption Generation: Automates the generation of video captions.

  • ๐ŸŒ๐ŸŽฅ Asset Sourcing: Sources images and video footage from the internet, connecting with the web and Pexels API as necessary.

  • ๐Ÿง  Memory and persistency: Ensures long-term persistency of automated editing variables with TinyDB.