Articles | Open Access | https://doi.org/10.37547/ajast/Volume06Issue02-10

The Rise and Impact of Modern Generative AI Tools: A Comparative Study of Chatgpt, Gemini, Claude

Suhad Ateyah , University of Kufa, Iraq
Zainab Khairallah Kadhim , General Directorate of Education in Najaf, Iraq

Abstract

The present research utilizes three popular multimodal AI systems (Gemini, Cloud AI, ChatGPT) to evaluate their ability to interpret visual language by analyzing responses to three different images. In addition to evaluating the systems' ability to produce accurate and detailed descriptions of each image, this research evaluated their ability to contextualize each description within an appropriate framework of understanding, as well as assess their response times. Results indicate that ChatGPT provided the most accurate and descriptive descriptions of the images analyzed in this study, particularly those which depicted emotionally and/or socially nuanced scenes; that Gemini performed reasonably well in terms of conceptual interpretation, though was inconsistent in its provision of specific details regarding the image(s); and that while Cloud AI responded more quickly than either ChatGPT or Gemini, it failed to provide as much detail or relevance to the situation presented in the images. These findings emphasize the need to develop multimodal AI systems that balance speed, emotional intelligence, and semantic accuracy to be used in the real world when reasoning with images.

Keywords

Artificial Intelligence, Gemini, Natural Language processing

References

Google DeepMind. (2024). Gemini 1.5 Models: Technical Capabilities and Roadmap. https://deepmind.google/technologies/gemini

Anthropic. (2024). Claude and Constitutional AI: A Safer Path for LLMs. https://www.anthropic.com

Kasneci, E., Sessler, K., & Betsch, T. (2023). ChatGPT for Good? On Opportunities and Challenges of LLMs in Education. arXiv. https://arxiv.org/abs/2302.05756

McKinsey & Company. (2023). The Economic Potential of Generative AI: The Next Productivity Frontier. https://www.mckinsey.com.

Ji, Z., Lee, N., Frieske, R., et al. (2023). Survey of Hallucination in Natural Language Generation. arXiv. https://arxiv.org/abs/2302.03636

OpenAI. (2024). GPT-4 Technical Report. https://openai.com/research/gpt-4

Ziegler, D., Brockman, G., & Christiano, P. (2023). GitHub Copilot and AI in Software Development. GitHub Docs. https://docs.github.com/en/copilot

Mistral AI. (2024). Open-Weight Language Models from Europe. https://mistral.ai

OECD. (2023). Generative AI and Intellectual Property Rights. https://www.oecd.org/digital

Article Statistics

Copyright License

Download Citations

How to Cite

Suhad Ateyah, & Zainab Khairallah Kadhim. (2026). The Rise and Impact of Modern Generative AI Tools: A Comparative Study of Chatgpt, Gemini, Claude. American Journal of Applied Science and Technology, 6(02), 97–106. https://doi.org/10.37547/ajast/Volume06Issue02-10