Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00010361.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup on the bed.

Visual question: Please tell me what this is.

Answers:

  1. peas
  2. can green pigeon peas
  3. peas
  4. canned peas
  5. gandules verdes
  6. canned beans
  7. pigeon pearl verde
  8. canned pigeon peas
  9. green peanuts
  10. food

Reasons why answers differ:

Image captions:

  1. a can of goya green pigeon peas with boxes in the background
  2. A can of pigeon peas in a kitchen area.
  3. A small can of Goya brand pigeon peas
  4. close up of a can of goya brand green pigeon peas
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 2: VizWiz_train_00002259.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A box of food in a container.

Visual question: What is the name, brand name, and flavor of this?

Answers:

  1. gerber graduates lil entrees macaroni cheese
  2. gerber graduates toddler meals macaroni cheese peas carrots
  3. gerber graduates mac cheese
  4. gerber graduate macaroni cheese peas carrots
  5. gerber graduate toddlers macaroni cheese
  6. unanswerable
  7. gerber graduates toddler macaroni cheese peas carrots
  8. gerber graduates macaroni cheese peas carrots
  9. gerber graduates corn baby food
  10. graduates for toddlers gerber macaroni cheese

Reasons why answers differ:

Image captions:

  1. A box of baby food has macaroni on it
  2. A box of gerber toddler food is on a baby seat.
  3. A container of children's macaroni and cheese to feed a hungry belly.
  4. Gerber Graduates lil'entrees macaroni and cheese with peas and carrots.
  5. some sort of baby food that is by gerber baby

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_val_00003547.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that has a sticker on it.

Visual question: What is it?

Answers:

  1. coca cola 0 can
  2. ergerg
  3. coca cola 0
  4. coca cola
  5. coke 0
  6. coca cola 0 can
  7. coke 0
  8. coca cola 0
  9. coca cola 0
  10. coke 0

Reasons why answers differ:

Image captions:

  1. A can of Coca-Cola zero sitting next to a laptop computer and cell phone
  2. A can of diet coca cola is placed next to a gray colored laptop.
  3. A can of zero coca cola sitting next to part of a keyboard and screen of a laptop on the left and a part view of a cell phone to the right.
  4. A computer screen, a keyboard and a can of coca cola are on a table.
  5. A red, white and black soda can is sitting next to a laptop computer, on a beige work top.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00009916.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A computer monitor on top of a keyboard.

Visual question: Hello there. What is happening on this computer screen, please? Thank you.

Answers:

  1. unsuitable
  2. reboot
  3. unanswerable
  4. computer crash
  5. system failure
  6. windows
  7. unsuitable
  8. unsuitable
  9. asking if you want to restore using system restore restore left cancel right
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A laptop with blue background and two gray pop up windows.
  2. appears to be a picture of a computer screen
  3. Computer monitor with a pop up screen that says your computer was not able to start, and asking whether you wish to restore your computer or cancel.
  4. Dialog screen of startup repair on a computer
  5. The computer monitor has a grey popup box with commands displayed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00003444.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white blanket in a bed.

Visual question: What color is this shirt?

Answers:

  1. pink
  2. white
  3. white
  4. pink
  5. light pink
  6. pink
  7. pink
  8. pink
  9. white
  10. pink

Reasons why answers differ:

Image captions:

  1. A button down shirt bodice with a pocket.
  2. A light pink or white pillow with pink trim.
  3. A person wearing a button up shirt and a wire coming down their body.
  4. CORNER AND EDGE OF A PINK BEDDING SHEET
  5. I believe this might be a heating pad under a pink pillow of sheet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_val_00002729.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A vase with blue and white flowers on it.

Visual question: is this

Answers:

  1. unanswerable
  2. unanswerable
  3. bag
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. yes
  8. unanswerable
  9. unanswerable
  10. blue

Reasons why answers differ:

Image captions:

  1. a white box that is on a green flower rug
  2. Blue floral fabric encased in a clear plastic bag.
  3. Cloth with blue roses and a dotted pattern.
  4. It appears to be a comforter or end of a couch with a plastic covering the object has blue flowers and blue dots as designs.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 7: VizWiz_train_00006514.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a box of food on the floor.

Visual question: What kind of weiners are these?

Answers:

  1. butterball turkey franks
  2. butterball turkey franks
  3. turkey
  4. turkey
  5. turkey
  6. butterball turkey franks
  7. turkey franks
  8. turkey franks
  9. butterball
  10. turkey

Reasons why answers differ:

Image captions:

  1. A package of Butterball brand turkey franks hot dogs.
  2. A package of turkey hot dogs that are lactose free.
  3. A white hot dog package laying on a counter top
  4. An unopened package of Butterball Turkey Franks resting on a grainy surface.
  5. blue and beige colored bag of turkey hot dogs on a beige countertop

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00014278.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a couch with a pillow.

Visual question: What color is the shirt?

Answers:

  1. pink
  2. grey
  3. white
  4. tan
  5. eggshell
  6. paper
  7. grey
  8. white
  9. white
  10. off white

Reasons why answers differ:

Image captions:

  1. A wrinkled tan colored piece of fabric is shown.
  2. it appears to be a white fabric like a thin blanket
  3. Possible rock with cracks on it and a black background.
  4. Quality issues are too severe to recognize visual content.
  5. white blanket sheet hanging up with a blue carpet floor

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_train_00022095.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a bottle of diet peach pure leaf brand iced tea
  2. A half full bottle of peach iced tea on a desk.
  3. A half-empty bottle of Pure Leaf brand diet peach iced tea sits on a work desk.
  4. A plastic bottle with tea inside it has some kind of writing on it is on top of a desk and another plastic bottle and a lamp stand is behind it in the background.
  5. Here you have a picture of a bottle of Pure Leaf real brewed tea in diet peach about half full on a wood surface desk with other things including a Bottle of Mountain Dew and a desk lamp behind it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00022582.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bag of coffee that is silver is on the table.
  2. A metallic silver bag containing Brazilian coffee particles.
  3. A silver bag of Brazilian Roasted Coffee Beans
  4. A silver bag of coffee beans with a white label depicting the coffee beans inside.
  5. a silver bag of specially roasted coffee from Another Coffee

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_train_00012044.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of scissors.

Visual question: What's in the picture?

Answers:

  1. sardines can
  2. fish
  3. pink salmon
  4. can salmon
  5. canned salmon
  6. salmon
  7. salmon
  8. can fish
  9. salmon
  10. canned salmon

Reasons why answers differ:

Image captions:

  1. a can of double brand wild caught Alaskan salmon
  2. A hand holding a can of fish with other food products in the background.
  3. A red can containing pink salmon caught in the wild.
  4. A red labeled tin can of pink salmon is unopened in someone's hand.
  5. Canned pink Salmon caught in the wild in Alaska.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_train_00017954.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s hand.

Visual question: What is that?

Answers:

  1. belly button
  2. navel
  3. belly button
  4. bellybutton
  5. belly button
  6. stomach
  7. belly button
  8. belly button
  9. someones belly
  10. baby bump

Reasons why answers differ:

Image captions:

  1. A person's belly button is shown and they are fat
  2. An image of someone's belly and belly button.
  3. Belly button of a human beings abdominal area
  4. Fingers resting on a bare upper torso several inches above the belly button.
  5. Two fingers, a belly, and a belly button.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_val_00002324.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is in this can?

Answers:

  1. unanswerable
  2. unanswerable
  3. spaghetti
  4. food
  5. unanswerable
  6. unsuitable
  7. spaghetti
  8. unanswerable
  9. tomato sauce
  10. spaghetti meatballs

Reasons why answers differ:

Image captions:

  1. A can of food has a nutritional label and is yellow.
  2. A can of food lying on its side with a portion of the product image and nutrition facts sections of the label visible.
  3. a tin food can with a yellow label and a nutritional label
  4. A yellow can with a nutrition label on it.
  5. The side of a can laying on its side with only part of label showing.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00000011.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a video game.

Visual question: What is the sodium content of this can of food?

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. unsuitable
  8. unanswerable
  9. insufficient photo quality
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A person holding a food can with the rear of the label facing forward.
  2. A White man's hand holding a Vitamin Bottle.
  3. canned food held by a man's fingers with the thumb visible
  4. imagine how you would describe this image on the phone to a friend.
  5. the photographer's hand holding a round bottle with a black lid.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 15: VizWiz_train_00003593.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a brown wall on the floor.

Visual question: What is the color?

Answers:

  1. tan
  2. tan
  3. tan
  4. tannish pink
  5. tan
  6. tan
  7. brown
  8. brown
  9. brown
  10. tan

Reasons why answers differ:

Image captions:

  1. A brown colored cloth is made of suede,
  2. a tannish brown fabric with light creasing in the middle
  3. A yellow piece of fabric with a line running through it
  4. Quality issues are too severe to recognize visual content.
  5. Yellow shag blanket or fleece blanket or wrinkled carpet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 16: VizWiz_train_00014862.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a bed.

Visual question: What kind of candy is this?

Answers:

  1. peanut butter egg
  2. peanut butter egg
  3. peanut butter egg
  4. peanut butter eggs
  5. chocolate peanut butter
  6. peanut butter egg
  7. peanut butter egg
  8. peanut butter egg
  9. peanut butter egg
  10. peanut butter egg

Reasons why answers differ:

Image captions:

  1. A package of a peanut butter egg on the lap of a person.
  2. A person has a candy egg in their lap.
  3. A white box containing a peanut butter egg, placed upon the lap of someone wearing denim shorts.
  4. Man in denim shorts holding an unopened peanut butter egg
  5. Multi-colored box, appears to be an Easter chocolate, reads: Peanut butter egg.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00011271.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white surface with an umbrella.

Visual question: What is this?

Answers:

  1. shirt
  2. pants
  3. unsuitable
  4. khaki pants
  5. unanswerable
  6. clothing
  7. towel
  8. unsuitable
  9. clothes
  10. tan material

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The pocket of a piece of clothing with several lines of stitching.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_train_00016486.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What's in this box?

Answers:

  1. juice
  2. sophie juice
  3. sophie juice
  4. unanswerable
  5. charger
  6. charger
  7. sophie juice
  8. juice
  9. sophie juice
  10. charger

Reasons why answers differ:

Image captions:

  1. A package that has a picture of a battery on it and the text mophie juice.
  2. A person's hand is shown in front of a control panel for a juice processor.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_val_00004141.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding up a sign with some dogs.

Visual question: What is this?

Answers:

  1. potatoes
  2. box betty crocker roasted garlic cheddar mashed potatoes
  3. instant potatoes
  4. roasted garlic cheddar mashed potatoes
  5. instant potatoes
  6. roasted garlic cheddar instant potatoes
  7. mashed potatoes
  8. mashed potatoes
  9. roasted garlic cheddar mashed potatoes
  10. potatoes

Reasons why answers differ:

Image captions:

  1. a food which is used as a topping for food contains potato, garlic and cheese
  2. A red box of Betty Crocker brand flavored instant potato flakes.
  3. A red box of Betty Crocker mashed potatoes held in a person's left hand.
  4. a red paper package box of Betty Crocker Roasted Garlic & Cheddar
  5. An open box of Betty Crocker roasted garlic potatoes

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00013944.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A coffee cup sitting on top of a table.

Visual question: What's in this can?

Answers:

  1. whole kernel corn
  2. low sodium whole kernel corn
  3. corn
  4. corn
  5. supersweet whole kernel corm
  6. supersweet whole kernel corn
  7. whole kernel corn
  8. corn
  9. corn
  10. whole kernel corn

Reasons why answers differ:

Image captions:

  1. 15 point 25 ounce can of Hart Brand low sodium super sweet whole kernel corn
  2. A can of low sodium super sweet whole kernel corn that is upside down on a countertop.
  3. a can of sweet corn on a counter top
  4. A can of whole kernel corn sits upside down on a white countertop.
  5. Pictured is a can of low sodium super sweet whole kernel corn.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00011751.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: I've got two kinds of cheese, American and Cheddar. Which one is this?

Answers:

  1. unsuitable
  2. unsuitable
  3. cheddar
  4. american
  5. cheddar
  6. unsuitable
  7. cheddar
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a food product is wrapped in a plastic wrapper.
  2. A vacuum sealed package of fresh cheese on a marble table top.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. something wrapped in a tight plastic bag with a price tag and the box of ingredients.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 22: VizWiz_train_00006648.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What box is this?

Answers:

  1. green tea
  2. tea
  3. teabags
  4. tea
  5. green tea
  6. jasmine green tea
  7. foojoy jasmine green tea bags
  8. tea
  9. tea bag box
  10. tea

Reasons why answers differ:

Image captions:

  1. A box of Jasmine green tea is sitting on the counter top.
  2. A box of jasmine tea laying on a dark wood surface.
  3. A green box of FooJoy Jasmine Green Tea with white letters.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_val_00003333.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a piece on a table.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. paper
  7. unsuitable
  8. paper
  9. appears to be piece paper maybe receipt
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a paper object is on top of a wooden surface.
  2. A white piece of paper with a few scribblings on a wood surface.
  3. An up close photo of a sheet folded and sitting on a table.
  4. Quality issues are too severe to recognize visual content.
  5. worn out piece of paper on the edge of a table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 24: VizWiz_val_00000863.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What card is this?

Answers:

  1. unanswerable
  2. red credit card
  3. unsuitable image
  4. unsuitable image
  5. unanswerable
  6. credit card
  7. unanswerable
  8. unsuitable image
  9. unsuitable image
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a picture of the back of a credit card.
  2. A plastic debit card on its swiping strip side.
  3. Red credit card with blank signature line and black strip showing.
  4. the back of a credit or debit card on a counter
  5. The back of a red credit card resting on a table

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 25: VizWiz_train_00017256.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A keyboard that is sitting on a wall.

Visual question: What is this device?

Answers:

  1. external hard drive
  2. keyboard
  3. unsuitable
  4. keyboard
  5. keyboard
  6. unsuitable
  7. keyboard
  8. keyboard
  9. unanswerable
  10. this keyboard

Reasons why answers differ:

Image captions:

  1. a black computer keyboard with white characters and a metal slot
  2. A piece of specialized equipment attached to a computer keyboard.
  3. On a fabric surface is a keyboard and grey tray.
  4. part of a keyboard sitting on a rug
  5. Some sort of partial keyboard with a metal piece coming out of the top.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 26: VizWiz_train_00009910.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture with a brown couch on the floor.

Visual question: What is this type of design on my shirt?

Answers:

  1. i dont know
  2. 0
  3. no design
  4. stripes
  5. solid color
  6. 0
  7. plain
  8. sofa
  9. light brown no design
  10. 0

Reasons why answers differ:

Image captions:

  1. A brown fabric or carpet with two small stains on it
  2. A brown knitted fabric with two small wet spot near the center.
  3. A brown piece of fabric with a few stains.
  4. A piece of fabric with discolored stains on it.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_train_00015196.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of wine sitting on a table.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unsuitable
  3. bottle
  4. unsuitable
  5. bottle
  6. dont know
  7. lotion
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A body care product in a white bottle on a brown table
  2. A bottle of lotion on a brown surface in front of napkins and a phone.
  3. a small white bottle with a green label sitting on a table
  4. A white plastic bottle on top of a wooden desk
  5. Pictured is a small hotel brand bottle of shampoo or conditioner.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00003511.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a toilet in a room.

Visual question: What's this?

Answers:

  1. unanswerable
  2. unsuitable
  3. speaker
  4. unanswerable
  5. speaker
  6. unsuitable
  7. speaker
  8. speaker
  9. speaker
  10. speaker

Reasons why answers differ:

Image captions:

  1. A small speaker set on a speaker stand.
  2. Part of an older CRT television screen, a white audio speaker and an unidentified item made of metal and faux wood.
  3. partial image of computer monitor speaker and a trophy
  4. Right side of a silver TV with speaker showing
  5. Television small speaker and a small art piece on the right.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00015826.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a cell phone in their hand.

Visual question: Expiration date on this.

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. An Oscar Mayer Lunchables Basics Turkey and Cheddar .
  2. Ingredients for Turkey and Cheddar lunch kit from the Kraft brand.
  3. Quality issues are too severe to recognize visual content.
  4. the back of a container of food showing nutritional information
  5. the ingredients on the back of a small lunchable

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_train_00009435.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall with a sign.

Visual question: Describe the screen.

Answers:

  1. microsoft windows loading screen
  2. blank
  3. unanswerable
  4. black horizontal white line
  5. blank word microsoft
  6. white line through middle
  7. black white lines writing
  8. loading
  9. dark horizontal white line
  10. black line across center loading image at bottom

Reasons why answers differ:

Image captions:

  1. a black screen of some sort with a word and loading bar
  2. A computer screen shows Microsoft windows booting up
  3. A computer screen with a thin solid line horizontally across the middle as well as a progress bar of loading content at the bottom.
  4. Black laptop screen with loading Microsoft logo with power button.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_train_00017209.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bed that is sitting on the floor.

Visual question: Can you tell me what this is? I hope I took the picture right, and thank you this is a wonderful application.

Answers:

  1. table
  2. bed
  3. stuff on table
  4. sideways picture bed table top
  5. unanswerable
  6. unanswerable
  7. napkin holder
  8. dining table
  9. table
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bedside tray with mail and possibly an alarm clock.
  2. A fruit table linen with a letter holder on top of it.
  3. Table top with a cover that has pictures of fruit and objects on a table
  4. table with floral tablecloth with an organizer sitting on it holding lots of miscellaneous items
  5. The top side of a table with a fruit sheet over it and a toaster on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 32: VizWiz_train_00021037.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A photo of a door knob with the handles and lock on a white door
  2. An open door with a golden handle and silver screws in it
  3. An open white door and brass door handle.
  4. Quality issues are too severe to recognize visual content.
  5. White door with a bronze handle that is next to a wall.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_train_00006227.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a toothbrush on a green surface.

Visual question: What game is this?

Answers:

  1. video game
  2. unanswerable
  3. unanswerable
  4. car
  5. not sure
  6. driving
  7. grand theft auto
  8. grand theft auto
  9. driving game
  10. racing game

Reasons why answers differ:

Image captions:

  1. A blue car in the video game Grand theft auto series.
  2. a TV screen showing a car in a grand theft auto video game
  3. A TV screen with a video game picture on it of a blue car.
  4. screen with blue car on a green background
  5. The monitor screen show the blue car with green background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 34: VizWiz_train_00000145.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator on a table.

Visual question: What is this tablet please?

Answers:

  1. folic acid
  2. rotate to left unable to see label
  3. folic 800
  4. folic acid
  5. unsuitable
  6. unsuitable
  7. folic acid
  8. folic acid
  9. unsuitable
  10. supplement

Reasons why answers differ:

Image captions:

  1. a bottle of folic acid pills on a wood grain table
  2. A bottle of medicine laying on a wooden table
  3. a plastic container of some kind of food supplement
  4. A white bottle of vitamins with Folic and 800 as well as a bar code
  5. A white bottle with an orange label with black text written on it and the bottle is laying on its side.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00015030.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of scissors on a wooden surface.

Visual question: What is it?

Answers:

  1. nailclippers
  2. nailclipper
  3. clippers
  4. nail clipper
  5. nail clippers
  6. nail clippers
  7. nail clipper
  8. nail clippers
  9. nail clipper
  10. nail clipper

Reasons why answers differ:

Image captions:

  1. a pair of large toenail clippers on its side on a smooth wood surface.
  2. A side view of metal toe clippers on a wooden surface.
  3. A silver nail clipper in closed position on a wooden surface.
  4. The side view of a nail clipper lying on a wooden surface.
  5. Toenail clippers sit on its side on a wooden table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00011567.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee is on a table.

Visual question: What's the picture on this cup?

Answers:

  1. santa snowflakes
  2. santa
  3. santa
  4. santa
  5. santa
  6. santa
  7. santa
  8. santa clause
  9. cartoon santa snowflakes
  10. santa

Reasons why answers differ:

Image captions:

  1. A green and white coffee cup with a jolly red Santa.
  2. a green and white coffee mug with Santa Claus on it
  3. A mug with a Santa Claus cartoon print on a green background
  4. a white and green coffee mug with Santa and snowflakes on it
  5. Christmas designs decorate this coffee mug on the table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 37: VizWiz_val_00002414.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop on a bed.

Visual question: What's on the screen?

Answers:

  1. screen black
  2. nothing
  3. nothing
  4. nothing
  5. nothing
  6. triangle
  7. reflection
  8. nothing
  9. nothing
  10. screen blank

Reasons why answers differ:

Image captions:

  1. a black DELL laptop computer placed on the floor
  2. A laptop computer with snaking cords is sitting on a work surface.
  3. A laptop is on a desk and there are wires around it
  4. A view of a black laptop that is not powered on.
  5. An open laptop that does not look like it's turned on with assorted cords to the left of it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 38: VizWiz_val_00004632.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of corn is sitting on the counter.
  2. A food item tin which is sealed is seen and nearby it there are few items
  3. A tin can of whole kernel corn is on a laminate table.
  4. A tin can with a white label sitting on a wooden table with metal edges.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00018393.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee is sitting on a table.

Visual question: What is this?

Answers:

  1. diet coke
  2. coke caffeine free
  3. can soda
  4. pop can
  5. diet coke
  6. diet coke
  7. soda
  8. unanswerable
  9. cola
  10. caffeine free diet coke

Reasons why answers differ:

Image captions:

  1. a can bottle of CAFFEINE FREE COKE on a plane surface
  2. A gold and red Caffeine Free Coke stands on a table with some electronics cords and styrofoam cups.
  3. A silver can of diet coke, sitting on a desktop next to a CD spindle and Styrofoam cups.
  4. an open can of caffeine free diet coke on a desk
  5. Pictured is a can of diet caffeine free coke on a desk.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_val_00007104.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up photo of some ground beef wrapped on a red plate
  2. A package of ground beef is on the plate in front of you.
  3. A wrapped item is placed in a plate or a red basin.
  4. a wrapped package of ground beef on a round, red plate
  5. An unknown bag of some sort with lettering on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00007304.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a bottle of milk.

Visual question: Is this my eye medication?

Answers:

  1. yes
  2. yes
  3. no
  4. yes
  5. yes
  6. yes
  7. brimonidine tartrate
  8. yes
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A bottle of medicine shows its active ingredients
  2. a brimonidine tartrate solution being held by someone
  3. A hand holding a bottle containing Brimonidine Tartrate.
  4. A hand holding a bottle of brimonidine tartrate eye drops.
  5. A hand is holding a white bottle of eye drops with a purple lid.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00003942.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a jar of food next to a cup.

Visual question: What is it?

Answers:

  1. peanut butter
  2. jar peanut butter
  3. peanut butter
  4. peanut butter
  5. peanut butter
  6. peanut butter
  7. peanut butter
  8. 1 jar creamy salted peanut butter
  9. peanut
  10. peanut butter

Reasons why answers differ:

Image captions:

  1. a container of peanut butter with the lid halfway off
  2. A jar of peanut butter with the lid partly ajar, a dirty knife lies in front of it.
  3. A slightly open jar of creamy salted peanut butter with a black plastic knife in front of the jar.
  4. Plastic clear container with a white top that has peanut butter
  5. Plastic jar of cream salted peanut butter from unblanched peanuts.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00013927.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A white refrigerator with a sign on it.

Visual question: what number is on the digital display?

Answers:

  1. 475
  2. 4.75
  3. 4.75
  4. 4.75
  5. 475
  6. 4.75
  7. 4.75
  8. 4.75
  9. 475
  10. 4.75

Reasons why answers differ:

Image captions:

  1. A machine called VTM basic value adder for reloading.
  2. A machine for adding value to a money card.
  3. a machine of transferring money most likely for credit cards
  4. A picture of a VTM with the instructions labeled on the front
  5. A VTM basic value adder with a digital display with red numbers of 475.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 44: VizWiz_train_00008285.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of the side of a metal meter.

Visual question: What is this?

Answers:

  1. sink
  2. unanswerable
  3. oven
  4. solar powered ipod iphone dock
  5. unanswerable
  6. unanswerable
  7. oven
  8. iphone dock
  9. eton ipod sound system
  10. stereo

Reasons why answers differ:

Image captions:

  1. A black object with the letters "eton" across the top and a hole in the front with what looks like speakers on the sides and icons showing batteries and a sun
  2. A screen shows an unknown image on some kind of electronic device.
  3. A speaker system is displayed and is black in color.
  4. Quality issues are too severe to recognize visual content.
  5. Some type of docking station by a Ihome clock radio.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 45: VizWiz_train_00006734.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A brown teddy bear sitting on a couch.

Visual question: What is this?

Answers:

  1. breakfast sandwich
  2. breakfast croissant sandwhiches
  3. breakfast sandwich
  4. croissants
  5. sausage egg cheese breakfast croissant
  6. sandwiches
  7. sausage egg cheese on croissant
  8. sandwich
  9. croissant sandwiches
  10. breakfast sandwiches

Reasons why answers differ:

Image captions:

  1. A box of croissant sandwiches filled with sausage, egg and cheese.
  2. A close up of a box of croissant sandwiches.
  3. Partial image of a package of croissant sandwiches that look as if they may be breakfast sandwiches
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 46: VizWiz_train_00022965.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a device which is used for typing and a tube contains cream
  2. A tube of toothpaste and a keyboard on a table
  3. A white plastic tube next to a keyboard
  4. Back side of tube of perhaps toothpaste sitting near a computer keyboard.
  5. the photo is of a tube of paste in front of a computer keyboard.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00020937.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A man holding a chick-fil-a box in front of his meal.
  2. A person is holding a packet of chick-fil-a food
  3. A person is holding a red, fast food container.
  4. Chick fil-A red fries container held in hand.
  5. Red fry box from chick fil a that has check written in white.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_val_00001891.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s hand.

Visual question: What's inside this image?

Answers:

  1. leg foot
  2. toes
  3. leg foot floor rug
  4. leg
  5. foot
  6. leg
  7. feet person rug
  8. 2 feet 1 leg 1 rug
  9. leg
  10. leg

Reasons why answers differ:

Image captions:

  1. A leg with white pants and two feet are visible
  2. A person in white pants is extending out their leg.
  3. A person is sitting on a designer rug.
  4. A person is wearing white pants and has their leg stretched out on a rug,
  5. A person sitting on the ground with their foot extended out in front of them.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 49: VizWiz_train_00007853.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of shoes sitting on top of a table.

Visual question: Are these shoes acceptable for springtime?

Answers:

  1. no
  2. yes
  3. no
  4. no
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A pair of black leather low boot sitting on brown table.
  2. A pair of boots on a piece of paper a wooden table
  3. A pair of dress boots on a table runner.
  4. a pair of shoes on a table runner which is on a wooden table with a couple of chairs
  5. Pair of brown boots sitting on a table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_val_00000779.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What's this?

Answers:

  1. laptop
  2. macbook
  3. keyboard
  4. laptop computer
  5. macbook
  6. keyboard laptop computer
  7. laptop computer
  8. mac app store
  9. lap top
  10. laptop

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor is displaying blind memory on it's screen.
  2. A laptop with the keyboard and a web page pulled up on the screen.
  3. a MacBook computer laptop sitting with the screen open and on.
  4. A MacBook computer screen with information about Blind Memory app on the screen.
  5. a picture of a MacBook laptop on a blind memory screen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.