How to feed image as part of the prompt in the task? #1802
Unanswered
khushalkumar
asked this question in
Q&A
Replies: 1 comment
-
To feed an image directly to an agent as part of the prompt in a Task, you typically need to provide the image URL or a data representation of the image within the text prompt itself, utilizing specific syntax depending on the language model or platform you're using; most commonly, this involves embedding the image data as a special tag or placeholder that the system can interpret as an image input. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We're using GPT4, which also supports image input. Then why does the agent not understand a base64 image when we provide it? How can we feed the image directly to the agent as part of the prompt in Task?
Beta Was this translation helpful? Give feedback.
All reactions