Thinking in Images

Images become a medium for thinking through soft navigation in latent space.

Thinking in Images - Key Visual

Overview

Thinking in Images explores how generative AI can function as a medium for reasoning rather than image production. The system allows users to navigate semantic space visually, enabling meaning to emerge gradually through interaction.

Role

Concept Development | Interaction Design | Computational Design

Team

Yuhan Wang, Joe Tu

Institution / Year

Harvard University - Graduate School of Design 2025

Tools

Python | Stable Diffusion | Latent Space Models | Generative AI | Custom Interface Prototyping

Thinking in Images - 1
Thinking in Images - 2

Background

Contemporary generative models primarily operate through prompt-response structures that position images as outputs of textual instruction. This approach limits the role of images to representation rather than cognition. The project questions whether latent space can instead operate as an exploratory medium where ideas evolve through iterative interaction.

Concept

The project introduces the idea of soft vector navigation within latent space. Instead of specifying precise visual targets, users provide approximate directional signals through sketching, image placement, and interaction. Meaning develops across trajectories rather than endpoints, allowing ambiguity and emergence to remain central to the process.

The Project

The interface enables users to interact with generative models through visual input rather than solely textual prompts. Drawings and image fragments are interpreted as approximate semantic directions, allowing the system to iteratively adjust visual trajectories while maintaining openness. Images become both the medium and the site of reasoning.

Media Carousel

Thinking in Images - 2

1 / 10

Thinking in Images - 3

2 / 10

Thinking in Images - 4

3 / 10

Thinking in Images - 5

4 / 10

Thinking in Images - 6

5 / 10

Thinking in Images - 7

6 / 10

Thinking in Images - 8

7 / 10

Thinking in Images - 9

8 / 10

Thinking in Images - 10

9 / 10

Thinking in Images - 11

10 / 10

Slide 1 of 10

Process

The system experiments with iterative interaction loops between user intuition and model inference. Visual inputs act as soft constraints guiding latent transitions. Through continuous feedback, semantic neighborhoods are explored rather than selected, allowing ideas to evolve dynamically.

Reflection / Impact

Thinking in Images suggests a broader paradigm shift in how generative AI may support creative thinking. By framing latent space as a cognitive environment, the project proposes new forms of human-AI collaboration where ambiguity and exploration become productive components of reasoning.