Mozualization: Crafting Music and Visual Representation with Multimodal AI

Wanfang Xu, Lixiang Zhao, Haiwen Song, Xinheng Song, Zhaolin Lu, Yu Liu, Min Chen, Eng Gee Lim, Lingyun Yu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this work, we introduce Mozualization, a music generation and editing tool that creates multi-style embedded music by integrating diverse inputs, such as keywords, images, and sound clips (e.g., segments from various pieces of music or even a playful cat’s meow). Our work is inspired by the ways people express their emotions—writing mood-descriptive poems or articles, creating drawings with warm or cool tones, or listening to sad or uplifting music. Building on this concept, we developed a tool that transforms these emotional expressions into a cohesive and expressive song, allowing users to seamlessly incorporate their unique preferences and inspirations. To evaluate the tool and, more importantly, gather insights for its improvement, we conducted a user study involving nine music enthusiasts. The study assessed user experience, engagement, and the impact of interacting with and listening to the generated music.

源语言英语
主期刊名CHI EA 2025 - Extended Abstracts of the 2025 CHI Conference on Human Factors in Computing Systems
出版商Association for Computing Machinery
ISBN(电子版)9798400713958
DOI
出版状态已出版 - 26 4月 2025
已对外发布
活动2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025 - Yokohama, 日本
期限: 26 4月 20251 5月 2025

出版系列

姓名Conference on Human Factors in Computing Systems - Proceedings

会议

会议2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025
国家/地区日本
Yokohama
时期26/04/251/05/25

指纹

探究 'Mozualization: Crafting Music and Visual Representation with Multimodal AI' 的科研主题。它们共同构成独一无二的指纹。

引用此