- The Meta
- Posts
- ▲ AI Learns How to Control Your Computer
▲ AI Learns How to Control Your Computer
AND: Runway AI reinvents character animation
Welcome future tech & AI enthusiasts — this week’s highlights:
Anthropic AI showcased how their model Claude can use a computer.
Midjourney has introduced a new AI image editor and retexture feature.
We’ll also look more into using AI image upscalers.
Let’s get into it.
In today’s issue:
Learn: How to upscale images with AI
Learn: How to create 3D text with AI
▲ AI TECH
Claude AI can use a computer to build apps
Anthropic AI showcased how their model Claude can use a computer. The skills are a feature called "computer use." It was highlighted with the release of Claude 3.5 Sonnet in October 2024. You can see it in action here.
What you need to know
What it does: Claude can now interact with computers much like a human would. This includes looking at the screen, moving the cursor, clicking buttons, and typing. This lets Claude navigate software, browse the web, and do complex, multi-step tasks.
Why it’s useful: This feature automates tasks that typically need humans. It boosts productivity and automates complex or repetitive desktop workflows. Developers can use this feature to automate tests, research, and multi-step tasks in software. They can also integrate it into their apps with the API.
How to use it: The "computer use" feature is in public beta. It's tied to the Claude 3.5 Sonnet model. Anthropic has made this accessible for developers through APIs.
Key Ideas
▲ Real-life use cases: Videos are emerging of Claude in action. They include filling out forms, using apps, and making simple programs in real time. Each use case comes with a mix of awe and concern. People are amazed by the technology's potential. But, its current limits show in tasks that need many steps or specific interactions. It’s early days — but it’s an exciting start.
▲ AI IMAGES
A Flying Whale in the Sky edited with Midjourney Editor
Midjourney has introduced a new AI image editor and retexture feature. This is yet another step forward for AI image creation and editing.
What you need to know
How it works: The editor lets users upload any web or personal image. They can then edit it within Midjourney's platform. This includes not just retexturing but also resizing, inpainting, and outpainting.
Retexturing: Users can now change the textures of images while keeping their shapes. This uses depth controlnet tech. It allows changes in surface details, lighting, and material based on new text prompts.
How to use it: Midjourney's CEO, David Holz, said only a small group of the community will initially have access. This cautious rollout aims to enhance both human and AI moderation to prevent misuse.
Public Reaction: There’s a lot of excitement about these updates as it opens up more ways to get creative in your projects. Although, there are concerns about ethics. This is especially true for deepfakes.
Key Idea
▲ AI in Art: Midjourney's latest features show that AI is more than a tool for creating new images. It can now make significant changes to existing ones. This blends user creativity with ai precision. Expect a lot more creative AI images appearing across the web.
▲ AI VIDEO
Runway AI can create character animations from recorded video
Runway has introduced Act-One, a new feature in its Gen-3 Alpha model. It aims to simplify character animation. You can see some incredible examples here.
What you need to know
What does it do: This tool lets creators make expressive character performances. They need only a single video and a character image. It replaces traditional motion capture and complex rigging setups.
How it works: Act-One captures facial expressions from a video input, like a smartphone recording. It applies these to AI-generated characters. It includes transferring subtle details of an actor's performance. This enables creating realistic animations from video and voice recordings.
How to use it: Access to Act-One has started rolling out to users. It will be available to everyone soon. This rollout includes users with free accounts but with limits on video-generating tokens.
Key Idea
▲ Motion capture reimagined: This tool is a big leap for video creators, animators, and filmmakers. It enables them to create more lifelike AI characters. They can do this without needing expensive or complex equipment. It's designed for different camera angles and focal lengths. It’s now possible with a simple phone camera to create Disney Pixar-esque animations — wild times.
▲ EVEN MORE NEWS
Other big ideas from the past week.
Claude lastest model benchmarks
Claude Sonnet Benchmarks: Showing areas where they outperform ChatGPT
ChatGPT Advanced Voice: All Plus users in the EU, Switzerland, Iceland, Norway, and Liechtenstein now have access to Advanced Voice
Perplexity just introduced Reasoning mode so you can ask multi-layered questions and Perplexity will adopt.
Mochi 1: A new AI video generator has been released AND it’s Open-source.
The Lab
▲ AI EDUCATION
Upscaling images
Magnific AI Upscale & Enhancer
Why Upscale Images?
Improve Quality: It makes your images clearer, sharper, and more detailed. Useful for both personal enjoyment and professional applications like marketing, art, or digital displays.
Better for Printing: Higher resolution images give better quality when printed. So you don’t get pixelation that can happen with low-resolution images.
Digital Use: E-commerce or digital art, upscaled images can make products or artworks stand out on websites or socials.
Restoration: You can take old or damaged photos and bring them a new life — from old family memories to restoring historical documents.
3 Platforms I’ve Tested:
Results:
If you want a free option you can run Upscayl locally on your computer. If you’ve already got a Canva subscription it’s easy to just run it in app. If you want more advanced features such as image enhancing then Magnific AI is the go-to paid option.
🔬 Interested in learning about AI for business? Let us know what you want to learn & join The Lab waitlist.
▲ AI IMAGE SHOWCASE
Create this 3D Render typography with Ideogram
Create More: 3d render typography
Ideogram Prompt: A 3d render photo of a vibrant illustration of the word "Create More" in intricate gold typography. The letters are adorned with ornate flowers, leaves, and butterflies. The background is dark. The overall image has a shimmering, iridescent quality, reminiscent of opal or crystal.
Image Usage Suggestion: Social media, posters, print products
💌 Reply to this email with your AI image generation or suggestion of what you’d like to see and I'll feature it in a future newsletter.
▲ SHARE YOUR THOUGHTS
Help shape the content you see here by giving feedbackYou can add more feedback after choosing an option |
Have specific feedback or want to get in touch? Reply to this email and I’ll get back to you.
Know someone who’d love this newsletter? Forward it to a friend and have them sign up here.
Thanks for reading — until next time.
Stay curious,
Matt Lok, Editor
Brought to you by metalabs. Helping boost your brand’s visibility and attract more leads through branding & digital marketing.