Articles
| Open Access |
https://doi.org/10.37547/ajast/Volume06Issue02-10
The Rise and Impact of Modern Generative AI Tools: A Comparative Study of Chatgpt, Gemini, Claude
Abstract
The present research utilizes three popular multimodal AI systems (Gemini, Cloud AI, ChatGPT) to evaluate their ability to interpret visual language by analyzing responses to three different images. In addition to evaluating the systems' ability to produce accurate and detailed descriptions of each image, this research evaluated their ability to contextualize each description within an appropriate framework of understanding, as well as assess their response times. Results indicate that ChatGPT provided the most accurate and descriptive descriptions of the images analyzed in this study, particularly those which depicted emotionally and/or socially nuanced scenes; that Gemini performed reasonably well in terms of conceptual interpretation, though was inconsistent in its provision of specific details regarding the image(s); and that while Cloud AI responded more quickly than either ChatGPT or Gemini, it failed to provide as much detail or relevance to the situation presented in the images. These findings emphasize the need to develop multimodal AI systems that balance speed, emotional intelligence, and semantic accuracy to be used in the real world when reasoning with images.
Keywords
Artificial Intelligence, Gemini, Natural Language processing
References
Google DeepMind. (2024). Gemini 1.5 Models: Technical Capabilities and Roadmap. https://deepmind.google/technologies/gemini
Anthropic. (2024). Claude and Constitutional AI: A Safer Path for LLMs. https://www.anthropic.com
Kasneci, E., Sessler, K., & Betsch, T. (2023). ChatGPT for Good? On Opportunities and Challenges of LLMs in Education. arXiv. https://arxiv.org/abs/2302.05756
McKinsey & Company. (2023). The Economic Potential of Generative AI: The Next Productivity Frontier. https://www.mckinsey.com.
Ji, Z., Lee, N., Frieske, R., et al. (2023). Survey of Hallucination in Natural Language Generation. arXiv. https://arxiv.org/abs/2302.03636
OpenAI. (2024). GPT-4 Technical Report. https://openai.com/research/gpt-4
Ziegler, D., Brockman, G., & Christiano, P. (2023). GitHub Copilot and AI in Software Development. GitHub Docs. https://docs.github.com/en/copilot
Mistral AI. (2024). Open-Weight Language Models from Europe. https://mistral.ai
OECD. (2023). Generative AI and Intellectual Property Rights. https://www.oecd.org/digital
Article Statistics
Copyright License
Copyright (c) 2026 Suhad Ateyah, Zainab Khairallah Kadhim

This work is licensed under a Creative Commons Attribution 4.0 International License.