Multimodal artificial intelligence: evolution of technologies, architecture and new horizons of human-machine interaction
Бібліографічний опис статті:
Chernets Vadym. Multimodal artificial intelligence: evolution of technologies, architecture and new horizons of human-machine interaction//Наука онлайн: Міжнародний електронний науковий журнал - 2024. - №3. - https://nauka-online.com/publications/information-technology/2024/3/02-37/
Анотація: (English) Multimodal artificial intelligence (AI) is an advanced field that combines different types of data, such as text, images, audio, and video. The article discusses the evolution of multimodal AI, its architectural features, popular modern models (CLIP, DALL-E, GPT-4, Flamingo, and others), and the prospects for human-machine interaction. Particular attention is paid to the transformation of approaches to data processing, the integration of different modalities, and the creation of more natural interaction interfaces.