In tackling the New Year's articles, I opted to cover similar celebrations across numerous countries, not just the festivities on January 1st. This expanded the scope of the project substantially. To establish a baseline for comparison as the year progresses, I tasked the AI tools with generating most of the content autonomously, making few modifications beyond what is documented. This allowed me to objectively evaluate the AIs' independent abilities regarding content creation.
AI Trials: January Pt 1
In the first post AI Trials: January Part 1, I began an experiment with five AI models—Bard, Claude, Copilot, ChatGPT-4, and LLaMA—to see how effectively they could identify the top 10 most populous countries and detail their key events. Initially, Bard furnished the most thorough information. However, upon closer inspection, I found the reliability of its data lacking. Refining my questions for the AIs illuminated their capabilities and limitations, ultimately leading me to set aside Claude and Copilot for the remainder of the project due to their under-performance.
AI Trials: January Pt 2
In AI Trials: January Part 2, I confirmed that ChatGPT-3.5 worked effectively with large datasets, meriting the reservation of ChatGPT-4 for more complex tasks, while Claude was the most user-friendly tool for content creation. However, compiling detailed event lists challenged ChatGPT-4 and LLaMA, spotlighting the persistent need for human verification of AI-generated content accuracy.
AI Blog Posts
As an eternal tinkerer, my curiosity, passion, and sheer stubbornness fuel a relentless desire to experiment, learn, and share knowledge, which keeps my creative spirit ignited. I'm constantly looking for new areas to explore, driven by imagination to see where new and evolving technologies might take me.
Driven by passion, not profit, though a coffee is always welcome.
Disclaimer: The views and opinions expressed in this article are solely those of the author and do not reflect the official policy or position of Amazon Web Services (AWS). The author is a UX designer at Amazon Web Services (AWS) and has no involvement in, nor does their work pertain to, any collaborative agreements that AWS may have with Anthropic, the creators of Claude. The insights and analyses presented here are entirely independent and unrelated to any projects or initiatives between AWS and Anthropic. All content in this post is based on publicly available interfaces and is not influenced by the author's employer.