Cross Modal Diffusion Model
Barrett Burnworth
โ๏ธ What's up?
Generate Imagery/Video/Audio all at once, in one modelโฆ CWT? ๐ข
https://codi-gen.github.io/static/images/teaser.mp4
https://arxiv.org/abs/2305.11846
ShortGPT
https://github.com/RayVentura/ShortGPT
๐ Introduction to ShortGPT
ShortGPT is a powerful framework for automating content creation. It simplifies video creation, footage sourcing, voiceover synthesis, and editing tasks.
-
๐๏ธ Automated editing framework: Streamlines the video creation process with an LLM oriented video editing language.
-
๐ Scripts and Prompts: Provides ready-to-use scripts and prompts for various LLM automated editing processes.
-
๐ฃ๏ธ Voiceover / Content Creation: Supports multiple languages including English ๐บ๐ธ, Spanish ๐ช๐ธ, Arabic ๐ฆ๐ช, French ๐ซ๐ท, Polish ๐ต๐ฑ, German ๐ฉ๐ช, Italian ๐ฎ๐น, and Portuguese ๐ต๐น.
-
๐ Caption Generation: Automates the generation of video captions.
-
๐๐ฅ Asset Sourcing: Sources images and video footage from the internet, connecting with the web and Pexels API as necessary.
-
๐ง Memory and persistency: Ensures long-term persistency of automated editing variables with TinyDB.