Google announces Dreamix: a model that generates videos when given:
- video + prompt (Video editing)
- input images + prompt (Subject Driven Generation)
- input image + prompt (Image-toVideo
Tired: train a large language model (LLM) to generate text in human languages
Wired: train an LLM to generate proteins sequences, the language of life
In this recent paper, researchers trained a decoder-style LLM to generate functional proteins:
https://nature.com/articles/s41587-022-01618-2โฆ
on how to build a question-answering system on your custom website
Preprocess:
1. Website -> text (Scraping)
2. Text -> chunks -> embed
Inference:
1. Question -> embed
2. Get topK similar chunks
3. Answer generation using 1 & 2
https://platform.openai.com/docs/tutorials/web-qa-embeddingsโฆ
Today, we are opening up public access to a new AI product we have been building called Poe. Poe lets people ask questions, get instant answers, and have back-and-forth conversations with several AI-powered bots. (1/n)
is another great example of the ecosystem of AI startups leveraging Google Cloud's reliable and open infrastructure to build their businesses. https://goo.gle/3JIAWem
had a big impact on AI/ML. Their OpenAI Gym env for RL, their CLIP - a de facto model for any image-text research, their Whisper speech recognition is amazing - all open sourced.
In my view, they've had a bigger impact than many labs publishing 100s papers ๐งต
Data on the intellectual contribution to AI from various research organizations.
Some of organizations publish knowledge and open-source code for the entire world to use.
Others just consume it. twitter.com/johnjnay/statuโฆ
OpenAI just released a new model to distinguish between AI/human written text to protect against ChatGPT.
The classifier was trained on a pair of AI/human written dataset.
However.. I was easily able to trick it by using GPT3 to rewrite the text.
Demo: http://platform.openai.com/ai-text-classifierโฆ
Animatronics were first introduced by Disney in 1962 for the film Mary Poppins (released in 1964). Since then they have come a long way. This is a giant T-Rex presented at BBC in July 2018
[source: https://buff.ly/2A96pEf]
Can LLMs understand images? We introduce ๐ฅBLIP-2๐ฅ, a generic and efficient vision-language pre-training strategy that bootstraps from frozenโ๏ธimage encoders and frozenโ๏ธLLMs. BLIP-2 outperforms existing SoTAs with only 188M trainable parameters!
Github: https://github.com/salesforce/LAVIS/tree/main/projects/blip2โฆ
1/ Introducing high precision masked editing. No bleeding. Highly targeted AI based image editing to give you more control.
Here's an end-to-end workflow in 30 seconds:
Looped Transformers as Programmable Computers
Presents a framework for using transformer networks as universal computers by programming them with specific weights and placing them in a loop.
https://arxiv.org/abs/2301.13196
A group of Spam cans empowered by AI to tell their tales. The story is created by a neural network tuned on a specially altered piggy version of Brave New World. In this novel each character is born into a caste in the same way industrial farm animals are born into their fate.
One lesson from AI progress is itโs hard to predict the relative difficulties of various skills; one lesson from AI limitations is that humans are much more capable than we give ourselves credit for.
๐ฅ New (1h56m) video lecture: "Let's build GPT: from scratch, in code, spelled out."
https://youtube.com/watch?v=kCc8FmEb1nYโฆ
We build and train a Transformer following the "Attention Is All You Need" paper in the language modeling setting and end up with the core of nanoGPT.
Banning ChatGPT for scientific writing is misguided. It's like banning calculators or spell checkers
I find ChatGPT to be a very useful tool for generating a first version of a paragraph or refining an existing paragraph.
I predict this policy will be "refined" in <1 yr.
Can't wait to share our new Text-to-Audio model, AudioLDM. ๐
This video shows the generation result with a simple text prompt: "A music made by xxx".
More demos coming soon!๐
The paper will be available next Monday on arXiv! ๐
Our model will be open-sourced soon!๐
ChatGPT (and others) generate very fluent (but not always truthful) text.
Some worry that teachers, news-readers (like you!), and society in general will be swamped with AI-generated content.
That's why we built DetectGPT, a method for detecting if text comes from an LM.
new "Imagine" beta๐คฏ I used Text Prompts to generated all the 3D models on this coffee table scene... unlocking endless possibilities!
#LumaAI#UnrealEngine5#AI
demo for a GPT Talking Portrait ๐ค
It uses Whisper to let you ask GPT in your language, then One-Shot-Talking-Face will generate a talking portrait video from the answer ! Enjoy !
link: https://huggingface.co/spaces/fffiloni/gpt-talking-portraitโฆ