Video wird geladen...

Video konnte nicht geladen werden

Beim Laden dieses Videos ist ein Problem aufgetreten. Dies könnte an einem vorübergehenden Netzwerkproblem liegen oder das Video ist möglicherweise nicht verfügbar.

Pleased to share Spellburst: an LLM-powered creative coding tool, accepted at ACM UIST! Artists can move between semantic (high level) and syntactic (low level) ideas and explore many branches in parallel Paper: More on the Replit ⠕/Stanford Human-Computer Interaction Group collab:

Tyler Angert

16,199 subscribers

54,056 Aufrufe • vor 2 Jahren •via X (Twitter)

Kunst Wissenschaft & Technologie

Anya Rossi• Live Now

Private livecam show

11 Kommentare

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

I worked with Miroslav Suzara, @jennyhansolo, @chris_pondoc, and @HariSubramonyam. Our research goal was to understand how creative coders explore conceptual spaces and work across diff levels of abstraction.

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

The UI is an auto-layout node-based canvas, with p5 sketches connected together by "operators", which can modify the sketch with a prompt, merge sketches together semantically, extract properties, and diff. The point is to visualize how sketches diverge and converge over time

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

The core of the interface works well independently of the LLM integration. It's a great canvas for branching p5 sketches and quickly testing out changes in parallel

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

But the real magic is here: a) any connected downstream sketches will update every time you modify the prompt. b) divergent autocomplete helps artists push prompts in alt directions and uses the prev sketch as context, so it knows about vars / concepts used to guide suggestions

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

Secondly, c) global variables are extracted from sketches, linked to parts of the prompt they're related to, then linked to sliders. d) uses "semantic merging" to take unrelated sketches and combine elements of them together, like physics from 1 and and color palette from another

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

There's still a lot to do, but it's a promising direction for more expressive UIs for making interactive media. We'll post a full demo vid next week and I'll be demoing this live in San Francisco late October. We aim to make this publicly available to use soon!

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

This is the first (but not last) official research collab between @Replit and an academic institution. Hopefully this is some proof that you don't need a dedicated research team to contribute to the community! I did this in my free time / balanced it with my normal work.

Profilbild von Minn

Minnvor 2 Jahren

@ACMUIST @Replit @StanfordHCI This is reeaally neat. Would love to see this for audio too!

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

@ACMUIST @Replit @StanfordHCI yesss and just text too

Profilbild von adam ho

adam hovor 2 Jahren

@ACMUIST @Replit @StanfordHCI damnnnn this is crazy

Profilbild von Tyler Angert

Tyler Angertvor 2 Jahren

@ACMUIST @Replit @StanfordHCI Appreciate it :))

Ähnliche Videos

Happy to share that my PhD paper "Scene2Hap: Generating Scene-Wide Haptics for VR from Scene Context with Multimodal LLMs" has received a Best Paper Award (top < 1%) out of 6,730 submissions at ACM CHI (ACM CHI Conference), the most prestigious conference in the human-computer interaction field🙌 Scene2Hap is an LLM-centered system that automatically designs object-level vibrotactile feedback for entire VR scenes based on objects' semantic attributes (e.g., whether and how the object vibrates) and physical context (e.g., the object's density, spatial relationships). It then renders real-time haptic feedback across the scene, calculating vibration propagation based on LLM-inferred material properties. To the best of our knowledge, this is the first paper to address the problem itself: "designing haptic characteristics of a whole VR scene with one click." Thanks a lot to my co-first-author Easa, Sara Safaee, Paul Strohmeier, and my advisor Jürgen Steimle!

Happy to share that my PhD paper "Scene2Hap: Generating Scene-Wide Haptics for VR from Scene Context with Multimodal LLMs" has received a Best Paper Award (top < 1%) out of 6,730 submissions at ACM CHI (ACM CHI Conference), the most prestigious conference in the human-computer interaction field🙌 Scene2Hap is an LLM-centered system that automatically designs object-level vibrotactile feedback for entire VR scenes based on objects' semantic attributes (e.g., whether and how the object vibrates) and physical context (e.g., the object's density, spatial relationships). It then renders real-time haptic feedback across the scene, calculating vibration propagation based on LLM-inferred material properties. To the best of our knowledge, this is the first paper to address the problem itself: "designing haptic characteristics of a whole VR scene with one click." Thanks a lot to my co-first-author Easa, Sara Safaee, Paul Strohmeier, and my advisor Jürgen Steimle!

Arata Jingu

31,810 Aufrufe • vor 3 Monaten

We've been trying out the new OpenAI Image Gen 2 in Paper. It's a leap forward You can use the tool to explore ideas, generate mood boards, and then combine with agents to make UIs. What stands out is the text accuracy. And intention. Next level. It's available now in Paper

We've been trying out the new OpenAI Image Gen 2 in Paper. It's a leap forward You can use the tool to explore ideas, generate mood boards, and then combine with agents to make UIs. What stands out is the text accuracy. And intention. Next level. It's available now in Paper

Stephen Haney

66,988 Aufrufe • vor 1 Monat

The best design work doesn't happen in a chat box. You need space to explore ideas, create variants, and iterate Meet the new Replit Canvas Your agentic design tool to build beautiful websites, apps, marketing assets and more

The best design work doesn't happen in a chat box. You need space to explore ideas, create variants, and iterate Meet the new Replit Canvas Your agentic design tool to build beautiful websites, apps, marketing assets and more

Replit ⠕

1,688,771 Aufrufe • vor 18 Tagen

As the #Ebola outbreak in the #DRC is spreading rapidly, World Health Organization (WHO) is now revising our risk assessment to very high at the national level, high at the regional level, and low at the global level.

As the #Ebola outbreak in the #DRC is spreading rapidly, World Health Organization (WHO) is now revising our risk assessment to very high at the national level, high at the regional level, and low at the global level.

Tedros Adhanom Ghebreyesus

107,598 Aufrufe • vor 24 Tagen

Introducing GPT-4o, our new model which can reason across text, audio, and video in real time. It's extremely versatile, fun to play with, and is a step towards a much more natural form of human-computer interaction (and even human-computer-computer interaction):

Introducing GPT-4o, our new model which can reason across text, audio, and video in real time. It's extremely versatile, fun to play with, and is a step towards a much more natural form of human-computer interaction (and even human-computer-computer interaction):

Greg Brockman

4,358,955 Aufrufe • vor 2 Jahren

Software isn’t merely technical work anymore. It’s creative. Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents. Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides & more.

Software isn’t merely technical work anymore. It’s creative. Introducing Replit Agent 4. The first AI built for creative collaboration between humans and agents. Design on an infinite canvas, work with your team, run parallel agents, and ship working apps, sites, slides & more.

Amjad Masad

3,205,881 Aufrufe • vor 3 Monaten

T-pop provides artists with a high level of freedom. 4EVE, a Thai girl group show that artists can have partners and enjoy personal freedom while being artists.

T-pop provides artists with a high level of freedom. 4EVE, a Thai girl group show that artists can have partners and enjoy personal freedom while being artists.

-𝒇- ♡̩͙

624,665 Aufrufe • vor 5 Monaten

Kacey Musgraves is another level, An entire awards ceremony, and the only performance that isn't a cliché country music song is hers. Country artists need to put more creative on their work.

Kacey Musgraves is another level, An entire awards ceremony, and the only performance that isn't a cliché country music song is hers. Country artists need to put more creative on their work.

laertes𓇢𓆸

340,904 Aufrufe • vor 29 Tagen

🚨 JUST IN: Fraud Czar JD Vance CONFIRMS his national fraud prosecution strategy is pure genius He's starting with the low level fraud... The low level exposes the whole criminal enterprise. Then, you can prosecute the TOP FRAUDSTERS 🔥 You can't ignore ANY level of fraud. $1M in fraud can lead you to tens or HUNDREDS of millions! "It's impossible to get to the high level fraudsters unless you're sometimes willing to look at the low level fraudsters too." "I mean, how many of us have watched a good mob movie and how do those prosecutions always start?" "You never start with a boss, you start with the low level guys who then give you some sense of the criminal enterprise and then you can go after the high level fraud and the high level crime." "The fact that we return to blind eye to fraud under a million dollars meant that we weren't even looking at the people who were sitting on top of some of these fraud rings!"

🚨 JUST IN: Fraud Czar JD Vance CONFIRMS his national fraud prosecution strategy is pure genius He's starting with the low level fraud... The low level exposes the whole criminal enterprise. Then, you can prosecute the TOP FRAUDSTERS 🔥 You can't ignore ANY level of fraud. $1M in fraud can lead you to tens or HUNDREDS of millions! "It's impossible to get to the high level fraudsters unless you're sometimes willing to look at the low level fraudsters too." "I mean, how many of us have watched a good mob movie and how do those prosecutions always start?" "You never start with a boss, you start with the low level guys who then give you some sense of the criminal enterprise and then you can go after the high level fraud and the high level crime." "The fact that we return to blind eye to fraud under a million dollars meant that we weren't even looking at the people who were sitting on top of some of these fraud rings!"

Eric Daugherty

16,852 Aufrufe • vor 1 Monat

"The level of trust is at a very low level between Iran and the US." Yalda Hakim speaks with former UK ambassador to Iran, Nicholas Hopton on Iran's mindset after they launched a missile strike on a US base in Qatar. Read more: 📺 Sky 501

"The level of trust is at a very low level between Iran and the US." Yalda Hakim speaks with former UK ambassador to Iran, Nicholas Hopton on Iran's mindset after they launched a missile strike on a US base in Qatar. Read more: 📺 Sky 501

Sky News

53,637 Aufrufe • vor 11 Monaten

Heard about the new GitHub Copilot coding agent? 👀 Here's how it can free you up to focus on more creative and high-impact tasks. ⬇️

Heard about the new GitHub Copilot coding agent? 👀 Here's how it can free you up to focus on more creative and high-impact tasks. ⬇️

GitHub

32,121 Aufrufe • vor 1 Jahr

Battle of Tora Bora with John "Shrek" McPhee "I just took the time to focus and kill as many human beings as I could. They'd shoot at us, and we'd level it. We'd move up, they'd shoot at us, and we'd level it." SOB Tactical

Battle of Tora Bora with John "Shrek" McPhee "I just took the time to focus and kill as many human beings as I could. They'd shoot at us, and we'd level it. We'd move up, they'd shoot at us, and we'd level it." SOB Tactical

Shawn Ryan

210,770 Aufrufe • vor 1 Jahr

🚀 Excited to introduce Qwen-Image-Edit! Built on 20B Qwen-Image, it brings precise bilingual text editing (Chinese & English) while preserving style, and supports both semantic and appearance-level editing. ✨ Key Features ✅ Accurate text editing with bilingual support ✅ High-level semantic editing (e.g. object rotation, IP creation) ✅ Low-level appearance editing (e.g. addition/delete/insert) Try it now: Hugging Face: ModelScope: Blog: Github: API:

🚀 Excited to introduce Qwen-Image-Edit! Built on 20B Qwen-Image, it brings precise bilingual text editing (Chinese & English) while preserving style, and supports both semantic and appearance-level editing. ✨ Key Features ✅ Accurate text editing with bilingual support ✅ High-level semantic editing (e.g. object rotation, IP creation) ✅ Low-level appearance editing (e.g. addition/delete/insert) Try it now: Hugging Face: ModelScope: Blog: Github: API:

Qwen

658,124 Aufrufe • vor 10 Monaten

🚧 The #Restitched "Design Tool" 🖌️🎨 is a powerful, essential feature for artists (both in the level editor and wardrobe!) Use it to recolor almost any asset! In some cases, you can also set a 'secondary color' and switch between preset texture swatches.🤩✨

🚧 The #Restitched "Design Tool" 🖌️🎨 is a powerful, essential feature for artists (both in the level editor and wardrobe!) Use it to recolor almost any asset! In some cases, you can also set a 'secondary color' and switch between preset texture swatches.🤩✨

Restitched 🧵🧸

16,385 Aufrufe • vor 1 Monat

Today, everyone gets an iOS developer... In their pocket 👖📱 For the first time in human history, you can build a mobile app on your phone... And then immediately share it with your friends. No coding, just ideas. Officially on the App Store.

Today, everyone gets an iOS developer... In their pocket 👖📱 For the first time in human history, you can build a mobile app on your phone... And then immediately share it with your friends. No coding, just ideas. Officially on the App Store.

Chorus.com

275,326 Aufrufe • vor 1 Jahr

We at TwoPeaks are trying to make a fun Kart Racer where players can be a bit more creative and expressful. Here's an early blockout of a level I am currently working on.

We at TwoPeaks are trying to make a fun Kart Racer where players can be a bit more creative and expressful. Here's an early blockout of a level I am currently working on.

Jakob Wahlberg

53,295 Aufrufe • vor 5 Monaten

Introducing UI-Bench by AfterQuery. The first and only rigorous eval of vibe coding tools. > 4,000+ blinded pairwise judgments > , Figma make, and take the lead > v0 and Replit ⠕ ranked dead last > performance gaps = differences in LLM orchestration, prompting, design templates, and post-processing > link to our paper in the comments!

Introducing UI-Bench by AfterQuery. The first and only rigorous eval of vibe coding tools. > 4,000+ blinded pairwise judgments > , Figma make, and take the lead > v0 and Replit ⠕ ranked dead last > performance gaps = differences in LLM orchestration, prompting, design templates, and post-processing > link to our paper in the comments!

Spencer Mateega

33,389 Aufrufe • vor 9 Monaten

WATCH: Stanford University postdoc Kristina Gligorić aims to bridge the gaps between AI and the social sciences through the Stanford UniversityHAI student affinity group. Discover how Stanford University students can build thriving, human-centered AI research communities:

WATCH: Stanford University postdoc Kristina Gligorić aims to bridge the gaps between AI and the social sciences through the Stanford UniversityHAI student affinity group. Discover how Stanford University students can build thriving, human-centered AI research communities:

Stanford HAI

24,764 Aufrufe • vor 1 Jahr

high level players stumbling across low level players in mmorpgs

high level players stumbling across low level players in mmorpgs

Gonktinaauugpuaghhughh

583,149 Aufrufe • vor 2 Jahren

Brain signals and LLM embeddings converge for predicting every spoken or heard word. Beautiful research from Google AI They compared human brain activity during real conversations with internal embeddings from a speech-to-text LLM. Measured electrode signals in speech and language-related brain regions and matched them to the model’s word-level features. 🤖Key Highlights → Brain activity aligns linearly with LLM embeddings for real-life spoken conversations. → Sequence of comprehension: first speech sounds, then word meaning. → Sequence of production: planned meaning, then articulation, then hearing one’s own voice. → Consistent predictive coding (pre-onset anticipation, post-onset surprise) mirrors LLM next-word prediction. → Lower-tier auditory regions still show partial sensitivity to semantic information. 🤖 Model-Brain Alignment They observed a clear sequence: during comprehension, auditory cortex (superior temporal gyrus) showed strong correlation with speech embeddings, then language embeddings aligned with Broca’s area. During production, Broca’s area correlated with language embeddings before articulation, followed by motor cortex signals matching speech embeddings. This suggests that next-word prediction and higher-level meaning representation in the model parallel the brain’s approach. ⚙ So the study revealed a shared computational principle of predicting words in context. Even though the Transformer-based LLM processes words in parallel layers, the human brain processes them serially yet mirrors similar statistical regularities. This supports a “soft hierarchy” where both lower-level acoustic processing and higher-level semantic processing partially overlap in the brain.

Brain signals and LLM embeddings converge for predicting every spoken or heard word. Beautiful research from Google AI They compared human brain activity during real conversations with internal embeddings from a speech-to-text LLM. Measured electrode signals in speech and language-related brain regions and matched them to the model’s word-level features. 🤖Key Highlights → Brain activity aligns linearly with LLM embeddings for real-life spoken conversations. → Sequence of comprehension: first speech sounds, then word meaning. → Sequence of production: planned meaning, then articulation, then hearing one’s own voice. → Consistent predictive coding (pre-onset anticipation, post-onset surprise) mirrors LLM next-word prediction. → Lower-tier auditory regions still show partial sensitivity to semantic information. 🤖 Model-Brain Alignment They observed a clear sequence: during comprehension, auditory cortex (superior temporal gyrus) showed strong correlation with speech embeddings, then language embeddings aligned with Broca’s area. During production, Broca’s area correlated with language embeddings before articulation, followed by motor cortex signals matching speech embeddings. This suggests that next-word prediction and higher-level meaning representation in the model parallel the brain’s approach. ⚙ So the study revealed a shared computational principle of predicting words in context. Even though the Transformer-based LLM processes words in parallel layers, the human brain processes them serially yet mirrors similar statistical regularities. This supports a “soft hierarchy” where both lower-level acoustic processing and higher-level semantic processing partially overlap in the brain.

Rohan Paul

15,213 Aufrufe • vor 1 Jahr