Carnegie Mellon University
Carnegie Mellon University
ACM CHI 2024

Jigsaw: Supporting Designers to Prototype Multimodal Applications by Chaining AI Foundation Models

1Carnegie Mellon University
Jigsaw interface
Jigsaw is a prototype system that lets designers generate and alter creative content with AI foundation models represented as puzzle pieces. Designers can search for foundation model capabilities via the Catalog Panel (a) and combine capabilities across different modalities by assembling compatible pieces on the Assembly Panel (b). Designers can specify inputs and observe intermediate results via the Input and Output Panels (c). Designers can request the Assembly Assistant (d) to recommend a chain of foundation models to accomplish a specified task.

Summary

Recent advancements in AI foundation models have made it possible for them to be utilized off-the-shelf for creative tasks, including ideating design concepts or generating visual prototypes. However, integrating these models into the creative process can be challenging as they often exist as standalone applications tailored to specific tasks. To address this challenge, we introduce Jigsaw, a prototype system that employs puzzle pieces as metaphors to represent foundation models. Jigsaw allows designers to combine different foundation model capabilities across various modalities by assembling compatible puzzle pieces. To inform the design of Jigsaw, we interviewed ten designers and distilled design goals. In a user study, we showed that Jigsaw enhanced designers' understanding of available foundation model capabilities, provided guidance on combining capabilities across different modalities and tasks, and served as a canvas to support design exploration, prototyping, and documentation.

Video

About 1 min read

Example Results

Interior Design Exploration

Interior design workflows

Audio-Visual Storytelling

Audio-visual story

Music Creation Pipeline

Music creation