Multimodal artificial intelligence: evolution of technologies, architecture and new horizons of human-machine interaction
Bibliographic description of the article for the citation:
Chernets Vadym. Multimodal artificial intelligence: evolution of technologies, architecture and new horizons of human-machine interaction//Science online: International Scientific e-zine - 2024. - №3. - https://nauka-online.com/en/publications/information-technology/2024/3/02-37/
Annotation: Multimodal artificial intelligence (AI) is an advanced field that combines different types of data, such as text, images, audio, and video. The article discusses the evolution of multimodal AI, its architectural features, popular modern models (CLIP, DALL-E, GPT-4, Flamingo, and others), and the prospects for human-machine interaction. Particular attention is paid to the transformation of approaches to data processing, the integration of different modalities, and the creation of more natural interaction interfaces.