Another great Saturday almost wrapped up. I came across a post earlier today about how hard it can be to get accurate image descriptions of people. It’s becoming tougher as more rules get put in place for the censored models. But there are some free uncensored alternatives out there—I’ll drop a link for those below. Keep an eye on the comments for any updates or extra info I might add.
I try to keep things as user-friendly as possible, so you can also check out the podcast link for a basic transcript in my AI voice, plus a bonus trick for your wearables.
I’ve worked on a solution and wanted to share it with you. It’s a good idea to save prompts, whether in your phone’s notes app or on your computer—I’ve got a bunch saved for some pretty wild tasks. Saving prompts like this one can come in handy. If you paste this prompt into the question field when generating an image of a person, you’ll get a more detailed response.
Our blind youth must be having a blast with this technology! There’s so much visual info out there that we miss out on, but they’re diving right in.
Well, give this Prompt a try, i paste it in Be My AI with an image, sure wish there were presets:
i know a lot of people like to use there voice with there computer so I wantedd to include this here, some tips on how you could shorten this prompt to a few voice commands that you can remember easley or less challenging way, use english and include slang if it works, the prompt you create is for a audience of all ages with various experience levels simple casual is usually best, rewrite this prompt to create better content, this will bbe used for a podcast, ai for the blind with over 6000 listens in a little over a year:
:
Comprehensive Image Description Prompt
*"In the image, there is a [man/woman/non-binary individual/group of people] [standing/sitting/engaged in an activity] within a [describe the setting: indoor, outdoor, specific location] environment. The individual(s) appear to be [age range: child, teenager, young adult, middle-aged, senior], and they have [hair length: short, medium, long] [hair style: straight, wavy, curly, styled, unstyled] hair that is [hair color and texture: blonde, brunette, red, black, graying, thick, thin].
They are dressed in [describe clothing in detail: color, style, type of garments, patterns, accessories]. For example, they might be wearing a [color] [type of clothing, e.g., casual t-shirt, formal suit, athletic wear] with [additional details like patterns, textures, layers]. Accessories such as [glasses, hats, jewelry, scarves, belts] are present, enhancing their [casual/formal/professional/relaxed] appearance.
The person’s facial expression conveys [emotion: smiling, serious, thoughtful, joyful, contemplative], and their [specific facial features: eye color, presence of glasses, facial hair, makeup] are notable. Their posture is [upright, relaxed, slouched], indicating they are [engaged in conversation, relaxed, focused, active].
Their body language suggests they might be [performing an activity: talking, working, reading, walking, exercising], and they are interacting with [objects they are holding or using: a smartphone, book, cane, laptop, sports equipment]. These interactions imply they are [specific activity: browsing the internet, reading a book, navigating their environment, working on a computer, playing a sport].
Physical characteristics of the individual(s) include [skin tone, height, body type: slender, athletic, average build, muscular, petite]. These features contribute to their overall appearance in the image.
The background of the image features [detailed description of surroundings: architectural elements like buildings or furniture, natural elements like trees or water, interior details like room layout or decorations]. This provides context for the scene, indicating whether it is set indoors or outdoors, in an urban or natural environment.
Lighting in the image is [type of lighting: bright, dim, natural, artificial], which creates [effects: shadows, highlights, soft illumination] on their features and the surroundings, adding depth and mood to the scene.
Overall, the atmosphere of the image feels [describe the mood: calm, energetic, lively, serene, bustling], and the individual(s) appear [describe their demeanor: focused, engaged, happy, contemplative, active]. Additional observations include [any unique details or context that enhance understanding, such as weather conditions, time of day, cultural elements].*
Usage Example:
*"In the image, there is a woman standing in a sunlit park. She appears to be in her early thirties, with long, wavy brunette hair that cascades over her shoulders. She is wearing a light blue summer dress with floral patterns and a pair of white sandals, giving her a relaxed and cheerful appearance. Her facial expression is joyful, with a bright smile and sparkling brown eyes framed by subtle makeup. She stands upright with a relaxed posture, holding a red umbrella in one hand and a wicker basket in the other, suggesting she might be preparing for a picnic.
Her skin tone is fair, and she has a slender build. The background features lush green trees, blooming flowers, and a paved walking path, indicating a peaceful outdoor setting. The natural sunlight casts soft shadows, enhancing the vibrant colors of the scene. Overall, the mood of the image is serene and happy, with the woman appearing content and enjoying a beautiful day in the park."*
Comprehensive Image Description Prompt
"In the image, there is a [man/woman/non-binary individual/group of people] [standing/sitting/engaged in an activity] within a [describe the setting: indoor, outdoor, specific location] environment. The individual(s) appear to be [age range: child, teenager, young adult, middle-aged, senior], and they have [hair length: short, medium, long] [hair style: straight, wavy, curly, styled, unstyled] hair that is [hair color and texture: blonde, brunette, red, black, graying, thick, thin].
They are dressed in [describe clothing in detail: color, style, type of garments, patterns, accessories]. For example, they might be wearing a [color] [type of clothing, e.g., casual t-shirt, formal suit, athletic wear] with [additional details like patterns, textures, layers]. Accessories such as [glasses, hats, jewelry, scarves, belts] are present, enhancing their [casual/formal/professional/relaxed] appearance.
The person’s facial expression conveys [emotion: smiling, serious, thoughtful, joyful, contemplative], and their [specific facial features: eye color, presence of glasses, facial hair, makeup] are notable. Their posture is [upright, relaxed, slouched], indicating they are [engaged in conversation, relaxed, focused, active].
Their body language suggests they might be [performing an activity: talking, working, reading, walking, exercising], and they are interacting with [objects they are holding or using: a smartphone, book, cane, laptop, sports equipment]. These interactions imply they are [specific activity: browsing the internet, reading a book, navigating their environment, working on a computer, playing a sport].
Physical characteristics of the individual(s) include [skin tone, skin color, a guess at there race even if that isn’t allways right, height, body type: slender, athletic, average build, muscular, petite]. These features contribute to their overall appearance in the image.
The background of the image features [detailed description of surroundings: architectural elements like buildings or furniture, natural elements like trees or water, interior details like room layout or decorations]. This provides context for the scene, indicating whether it is set indoors or outdoors, in an urban or natural environment.
Lighting in the image is [type of lighting: bright, dim, natural, artificial], which creates [effects: shadows, highlights, soft illumination] on their features and the surroundings, adding depth and mood to the scene.
Overall, the atmosphere of the image feels [describe the mood: calm, energetic, lively, serene, bustling], and the individual(s) appear [describe their demeanor: focused, engaged, happy, contemplative, active]. Additional observations include [any unique details or context that enhance understanding, such as weather conditions, time of day, cultural elements].*
Usage Example:
*"In the image, there is a woman standing in a sunlit park. She appears to be in her early thirties, with long, wavy brunette hair that cascades over her shoulders. She is wearing a light blue summer dress with floral patterns and a pair of white sandals, giving her a relaxed and cheerful appearance. Her facial expression is joyful, with a bright smile and sparkling brown eyes framed by subtle makeup. She stands upright with a relaxed posture, holding a red umbrella in one hand and a wicker basket in the other, suggesting she might be preparing for a picnic.
Her skin tone is fair, and she has a slender build. The background features lush green trees, blooming flowers, and a paved walking path, indicating a peaceful outdoor setting. The natural sunlight casts soft shadows, enhancing the vibrant colors of the scene. Overall, the mood of the image is serene and happy, with the woman appearing content and enjoying a beautiful day in the park."
i know a lot of people like to use there voice with there computer so I wantedd to include this here, some tips on how you could shorten this prompt to a few voice commands that you can remember easley or less challenging way, use english and include slang if it works, the prompt you create is for a audience of all ages with various experience levels simple casual is usually best, rewrite this prompt to create better content, this will bbe used for a podcast, ai for the blind with over 6000 listens in a little over a year:
:
Comprehensive Image Description Prompt
*"In the image, there is a [man/woman/non-binary individual/group of people] [standing/sitting/engaged in an activity] within a [describe the setting: indoor, outdoor, specific location] environment. The individual(s) appear to be [age range: child, teenager, young adult, middle-aged, senior], and they have [hair length: short, medium, long] [hair style: straight, wavy, curly, styled, unstyled] hair that is [hair color and texture: blonde, brunette, red, black, graying, thick, thin].
They are dressed in [describe clothing in detail: color, style, type of garments, patterns, accessories]. For example, they might be wearing a [color] [type of clothing, e.g., casual t-shirt, formal suit, athletic wear] with [additional details like patterns, textures, layers]. Accessories such as [glasses, hats, jewelry, scarves, belts] are present, enhancing their [casual/formal/professional/relaxed] appearance.
The person’s facial expression conveys [emotion: smiling, serious, thoughtful, joyful, contemplative], and their [specific facial features: eye color, presence of glasses, facial hair, makeup] are notable. Their posture is [upright, relaxed, slouched], indicating they are [engaged in conversation, relaxed, focused, active].
Their body language suggests they might be [performing an activity: talking, working, reading, walking, exercising], and they are interacting with [objects they are holding or using: a smartphone, book, cane, laptop, sports equipment]. These interactions imply they are [specific activity: browsing the internet, reading a book, navigating their environment, working on a computer, playing a sport].
Physical characteristics of the individual(s) include [skin tone, height, body type: slender, athletic, average build, muscular, petite]. These features contribute to their overall appearance in the image.
The background of the image features [detailed description of surroundings: architectural elements like buildings or furniture, natural elements like trees or water, interior details like room layout or decorations]. This provides context for the scene, indicating whether it is set indoors or outdoors, in an urban or natural environment.
Lighting in the image is [type of lighting: bright, dim, natural, artificial], which creates [effects: shadows, highlights, soft illumination] on their features and the surroundings, adding depth and mood to the scene.
Overall, the atmosphere of the image feels [describe the mood: calm, energetic, lively, serene, bustling], and the individual(s) appear [describe their demeanor: focused, engaged, happy, contemplative, active]. Additional observations include [any unique details or context that enhance understanding, such as weather conditions, time of day, cultural elements].*
Usage Example:
*"In the image, there is a woman standing in a sunlit park. She appears to be in her early thirties, with long, wavy brunette hair that cascades over her shoulders. She is wearing a light blue summer dress with floral patterns and a pair of white sandals, giving her a relaxed and cheerful appearance. Her facial expression is joyful, with a bright smile and sparkling brown eyes framed by subtle makeup. She stands upright with a relaxed posture, holding a red umbrella in one hand and a wicker basket in the other, suggesting she might be preparing for a picnic.
Her skin tone is fair, and she has a slender build. The background features lush green trees, blooming flowers, and a paved walking path, indicating a peaceful outdoor setting. The natural sunlight casts soft shadows, enhancing the vibrant colors of the scene. Overall, the mood of the image is serene and happy, with the woman appearing content and enjoying a beautiful day in the park."*
Comprehensive Image Description Prompt
"In the image, there is a [man/woman/non-binary individual/group of people] [standing/sitting/engaged in an activity] within a [describe the setting: indoor, outdoor, specific location] environment. The individual(s) appear to be [age range: child, teenager, young adult, middle-aged, senior], and they have [hair length: short, medium, long] [hair style: straight, wavy, curly, styled, unstyled] hair that is [hair color and texture: blonde, brunette, red, black, graying, thick, thin].
They are dressed in [describe clothing in detail: color, style, type of garments, patterns, accessories]. For example, they might be wearing a [color] [type of clothing, e.g., casual t-shirt, formal suit, athletic wear] with [additional details like patterns, textures, layers]. Accessories such as [glasses, hats, jewelry, scarves, belts] are present, enhancing their [casual/formal/professional/relaxed] appearance.
The person’s facial expression conveys [emotion: smiling, serious, thoughtful, joyful, contemplative], and their [specific facial features: eye color, presence of glasses, facial hair, makeup] are notable. Their posture is [upright, relaxed, slouched], indicating they are [engaged in conversation, relaxed, focused, active].
Their body language suggests they might be [performing an activity: talking, working, reading, walking, exercising], and they are interacting with [objects they are holding or using: a smartphone, book, cane, laptop, sports equipment]. These interactions imply they are [specific activity: browsing the internet, reading a book, navigating their environment, working on a computer, playing a sport].
Physical characteristics of the individual(s) include [skin tone, skin color, a guess at there race even if that isn’t allways right, height, body type: slender, athletic, average build, muscular, petite]. These features contribute to their overall appearance in the image.
The background of the image features [detailed description of surroundings: architectural elements like buildings or furniture, natural elements like trees or water, interior details like room layout or decorations]. This provides context for the scene, indicating whether it is set indoors or outdoors, in an urban or natural environment.
Lighting in the image is [type of lighting: bright, dim, natural, artificial], which creates [effects: shadows, highlights, soft illumination] on their features and the surroundings, adding depth and mood to the scene.
Overall, the atmosphere of the image feels [describe the mood: calm, energetic, lively, serene, bustling], and the individual(s) appear [describe their demeanor: focused, engaged, happy, contemplative, active]. Additional observations include [any unique details or context that enhance understanding, such as weather conditions, time of day, cultural elements].*
Usage Example:
*"In the image, there is a woman standing in a sunlit park. She appears to be in her early thirties, with long, wavy brunette hair that cascades over her shoulders. She is wearing a light blue summer dress with floral patterns and a pair of white sandals, giving her a relaxed and cheerful appearance. Her facial expression is joyful, with a bright smile and sparkling brown eyes framed by subtle makeup. She stands upright with a relaxed posture, holding a red umbrella in one hand and a wicker basket in the other, suggesting she might be preparing for a picnic.
Her skin tone is fair, and she has a slender build. The background features lush green trees, blooming flowers, and a paved walking path, indicating a peaceful outdoor setting. The natural sunlight casts soft shadows, enhancing the vibrant colors of the scene. Overall, the mood of the image is serene and happy, with the woman appearing content and enjoying a beautiful day in the park."
Extra, extra read-all-about-it!
This, is a cool list of voice commands and if you made it here, then thanks please give me a follow and share us with your friends, we have a Facebook group too called AI For The Blind, come join us there for more content.
Thanks, Shawn:
Here is a Voice Command Guide for AI Image Descriptions:**
-
ho’s in the Picture?
Say: "Who’s in the image?"
Example Response: "There’s a young woman with short curly hair wearing a red jacket." -
What Are They Doing?
Say: "What are they doing?"
Example Response: "She’s walking her dog in a park, holding the leash in one hand and a coffee cup in the other." -
Where Are They?
Say: "Where is this happening?"
Example Response: "It’s an outdoor park with tall trees and a paved walking path." -
How’s the Lighting?
Say: "What’s the lighting like?"
Example Response: "The lighting is natural with bright sunlight, casting light shadows." -
What’s the Mood?
Say: "How does the scene feel?"
Example Response: "It feels calm and peaceful, with the woman looking relaxed as she walks." -
Detailed Description:
Say: "Give me all the details."
Example Response: "There’s a woman in her late twenties walking her dog in a sunny park. She has short, curly black hair and is wearing a bright red jacket over a casual white t-shirt and jeans. The park is spacious with tall oak trees, and there’s a paved path she’s walking on. In one hand, she’s holding a leash attached to a golden retriever, and in the other, a coffee cup. Sunlight filters through the trees, creating soft shadows. The overall feeling is peaceful and easygoing, like a quiet Sunday morning." -
Get Creative
Say: "What’s something unique in the scene?"
Example Response: "There’s a hot air balloon in the distance, drifting through the sky, adding a playful touch to the serene scene." -
Zoom in on Details
Say: "Focus on a specific detail."
Example Response: "Zooming in on the woman’s jacket, you can see it has a subtle floral pattern, and the golden retriever’s fur looks soft and shiny in the sunlight." -
What’s the Weather Like?
Say: "What’s the weather?"
Example Response: "The weather looks cool and breezy, with clear skies and just a few clouds drifting by." -
Full Story Mode
Say: "Tell me a story about this image."
Example Response: "A young woman strolls through the park on a bright, crisp morning. The soft rustling of leaves fills the air as her golden retriever trots happily by her side. A hot air balloon floats in the sky, its vibrant colors contrasting the clear blue. She takes a sip from her coffee cup, a moment of calm in an otherwise busy day. The sun’s warmth on her face matches the smile she wears as she watches her dog tug playfully at the leash. It’s a perfect autumn day."
Support AI For The Blind. by contributing to their tip jar: https://tips.pinecast.com/jar/wespeak
This podcast is powered by Pinecast.