While often we’ve heard about artificial intelligence or AI, which is the idea of technology being used to mimic or surpass human behaviour, it relies on various subsets of technological advancement such as machine learning, neural networks, computer vision, natural language proceessing, and deep learning.
As of writing this, these areas are all advancing fairly rapidly. The intention of this article is to share some of the interesting projects you may want to experiment with, I’ll prioritize with free and open source examples where possible.
Google’s Tensorflow has several demos
Perhaps the most popular, Dalle-e-2 requires you sign up for their wait list
Uberduck.ai has a free plan with over 2,000 voices which includes API access if you’re a software developer
Koe.ai has a free demo with 8 voices where you can transform 20 seconds of your voice in near real-time, but it is not open source
Descript’s Overdub (formerly lyrebird.ai, requires an account)
Jukebox is for software developers
AIVA requires an account and the free tier is for non-commercial
Boomy requires an account
As I write this at a coffee shop, I decided to try the free image examples with a prompt of “people writing at a coffee shop” to experiment with the various levels of quality: