Are you looking for a way to test ChatGPT? Try and get it to generate something that has procedural tasks, such “create an image of a step by step guide to make a paper airplane” or “create an image of a step by step guide to make an origami rabbit“.
This week I was asked to speak to a high school class about the challenges of artificial intelligence. As I prepared for the session I had the opportunity to chat with the teacher for the class. I talked about my testing with ChatGPT and my latest blog post about artificially generated images.
ChatGPT is really bad at some things
I made the comment, it’s really bad at drawing hands, especially when asking it to create someone with just four fingers (don’t ask, it was a Dungeons & Dragons request). I have included some of the testing I’ve done (on more than one occasion, with hand tests). The results are always similar.
Me: Create an image of a dwarf with four fingers.
ChatGPT: Here’s the portrait of the dwarf character with four fingers. Let me know what you think!
Me: Show me his hands! He should only have four fingers on his right hand. He lost one of his fingers in an accident.
ChatGPT: Here is a detailed close-up of the dwarf character’s right hand, showing that he has only four fingers.
Me: Make it three fingers…
ChatGPT: Here’s the updated image showing the dwarf character’s right hand with only three fingers.
I guess it worked? I still think this dwarf hand has four fingers, you can see the knuckle of the fourth finger.
ChatGPT is also really bad at procedural tasks/instructions
In our discussion, the teacher pointed out that ChatGPT also struggles with procedurally generated instructions that utilize images. He cited an example involving origami rabbit instructions.
Like always, I did some testing.
Me: Can you create an image of instructions for creating a paper airplane?
ChatGPT: Here is the step-by-step instructional illustration for creating a paper airplane. Each panel shows a different step in the process, designed to be easy to follow. You can view and use the image to make your own paper airplane.
Me: Now origami instructions for making a rabbit.
ChatGPT: Here is the step-by-step instructional illustration for creating an origami rabbit. Each panel illustrates a different folding step, making it easy to follow. You can view and use the image to create your own origami rabbit.
Me: Now origami instructions for making the titanic sinking.
ChatGPT: Here is the step-by-step instructional illustration for creating an origami version of the Titanic sinking. Each panel illustrates a different stage of folding, designed to visually depict the ship’s sinking. You can view and use the image to create your own origami Titanic.
Okay, so the last example is a bit cheeky, but I wanted to test the absurdity of this prompt. ChatGPT consistently struggles with images focusing on procedural tasks.
Can you spot the errors?
The errors are numerous. One doesn’t have to look beyond the simple numbering on the instructional images to see where ChatGPT struggles. This is consistent between the three tests I ran above. This isn’t new, consider a prompt I gave ChatGPT months ago when I asked it to create a mind map. It just doesn’t handle numbers and words very well.
With the paper airplane, you have to look closely. However, when you do, you can spot tons of errors. Once again it struggles with numbers.
In the image above with instructions for the origami rabbit, I lose the concept between steps 2 and 3. That’s a pretty big jump from a triangle shaped paper to a fully developed rabbit. The realism is uncanny.
With the Titanic, I think the errors are pretty glaring…
Wrapping up
I am continually amazed by ChatGPT’s capabilities. The technology not only impresses me but it also presents challenges and solutions for the information profession and society at large. Procedural tasks remain a serious issue for generative AI, and the above images serve as excellent examples of its limitations. However, as these models evolve, I am confident they will get better. Just consider the remarkable advancements in generative AI video over the past year.
Video generative AI is something I have not yet tested and I will soon publish a post about my experiences testing generative AI audio.
Header Photo by Calle Macarone on Unsplash