Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00008928.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A box of food sitting on a wall.

Visual question: What is this?

Answers:

  1. this food item
  2. frozen potatoes
  3. hash browns
  4. ore ida
  5. hasbrowns
  6. hash browns
  7. frozen potatoes
  8. frozen hashbrowns
  9. freida hash browns
  10. ore ida hash browns

Reasons why answers differ:

Image captions:

  1. A bag of hash browns are laying on the floor.
  2. a bag of Ore Ida frozen hash browns laying on a tile floor
  3. A frozen red package of hash browns is on the tile floor.
  4. Bag of frozen hash browns on a tiled floor.
  5. Unopened bag of Ore Ida hash browns on a tiled background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_val_00004077.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A small clock on a wooden wall.

Visual question: What is that?

Answers:

  1. clock
  2. clock
  3. clock
  4. wall clock
  5. clock
  6. clock
  7. clock
  8. clock
  9. clock
  10. clock

Reasons why answers differ:

Image captions:

  1. A clock with a bell attached to it is hanged on the wall
  2. a large brown wooden wall clock reading 9:07
  3. A vintage clock on a white wall and the time shows 9:10
  4. A wooden wall clock with a red second hand and an hour chime.
  5. A wooden-framed wall clock with gold trim reads nine-o-seven.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 3: VizWiz_train_00016919.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A couple of food that are next to each other.

Visual question: What is this?

Answers:

  1. food
  2. roast lamb
  3. lamb patties
  4. 4 lamb quater pounders
  5. food
  6. lamb burgers
  7. package lamb
  8. 4 lamb quarter
  9. lamb patties
  10. lamb patties

Reasons why answers differ:

Image captions:

  1. A box of 4 pre-made packaged lamb burgers that are lying on a carpeted floor.
  2. a box of four lamb quarter pound burgers for 2 quid
  3. a package of 4 lamb patties cost about 2 pounds
  4. A package of frozen beef patties is on the floor.
  5. Four ground lamb patties in a rectangle package with a yellow label and a picture of the meat on the front

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_val_00000679.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What did that say?

Answers:

  1. unsuitable image
  2. unsuitable image
  3. religious school paper
  4. unanswerable
  5. unsuitable image
  6. unsuitable image
  7. unanswerable
  8. unsuitable image
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a book or novel that you can read to learn stuff
  2. A church program for the Feast of Ascension.
  3. A document with at least two paragraphs on it and bold lettering at the top.
  4. A page of text, white paper and black text.
  5. Pictured is an excerpt from a book or magazine.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00019027.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a bottle on the ground.

Visual question: What is this?

Answers:

  1. unanswerable
  2. unsuitable
  3. lotion
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. moisturizing conditioner
  8. unanswerable
  9. unanswerable
  10. loation

Reasons why answers differ:

Image captions:

  1. A red tube of a certain soap or facial creme.
  2. A small plastic bottle laying on its side on a cushion.
  3. An orange tube of moisturizing conditioner on an off white cloth surface.
  4. Brown packaging with white writing for plastic conditioner on a tan blanket.
  5. The back of a bottle of moisturizing conditioner laying on a bed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00014849.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on a table.

Visual question: What is this?

Answers:

  1. wendys beverage cup
  2. wendyäó»s cup
  3. wendys cup
  4. wendys drink
  5. wendys to go cup
  6. wendys cup
  7. large cup
  8. wendys fountain rink
  9. wendys drink cup
  10. realfood

Reasons why answers differ:

Image captions:

  1. a container/ box / bottle that contains liquid / goods.
  2. A cup from Wendy's is on a table along with a cell phone.
  3. a Wendy's cup on a table with the tray and an iPhone beside it
  4. A white and red soda cup from Wendy's restaurant.
  5. Large soft serve drink from Wendy's restaurant

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00014529.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pink and white door.

Visual question: What color is it?

Answers:

  1. pink
  2. pink
  3. pink
  4. pink
  5. pink
  6. pink white trim
  7. pink
  8. pink
  9. pink
  10. pink

Reasons why answers differ:

Image captions:

  1. A pink painted wall between two doorways behind an air grate in a wooden floor.
  2. A pink wall a wooden floor with an air vent
  3. a pink wall in someone's home next to a door
  4. A smaller wall with an opening to the left and a closet door to the right of it.
  5. A very pink wall is next to the open closet

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 8: VizWiz_train_00005670.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bowl of food.

Visual question: Can you tell me the label on this meat pack what it is? Thank you.

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. no
  8. unanswerable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A white cloth is on top of a surface in a dark area.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 9: VizWiz_train_00023928.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Coffee come from.

Answers:

  1. japan
  2. china
  3. beans
  4. ucc coffee
  5. ucc
  6. japan
  7. unsuitable image
  8. unanswerable
  9. unanswerable
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 10: VizWiz_val_00004291.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is the expiration date?

Answers:

  1. too blurry to see
  2. unsuitable
  3. 12 jan 27
  4. unsuitable
  5. aug 2012
  6. may 12 2017
  7. 2012
  8. unanswerable
  9. 2012
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A microwavable entree is on top of a table.
  2. A plastic microwaveable container with ready to eat food with a directions label.
  3. An colorful package of an item laying sideways.
  4. Brightly colored packaging from a laundry product and part of a green plastic lid, sitting on a wooden desk next to a printer.
  5. I see the back of product packaging with instructions for cooking.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 11: VizWiz_train_00007725.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s hand.

Visual question: Please name this plant.

Answers:

  1. leaf
  2. leaf
  3. maple leaf
  4. unanswerable
  5. fern
  6. weed
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. white oak

Reasons why answers differ:

Image captions:

  1. A human hand holds a green leaf against a white piece of paper.
  2. A person holding a green leaf on a white sheet of paper.
  3. A PERSON HOLDING A SHEET OF PAPER AND A LEAF
  4. someone holding a piece of leaf in their hands
  5. White piece of paper with a green leaf on it being held by someone.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 12: VizWiz_train_00002670.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on the wall.

Visual question: What does this say?

Answers:

  1. presentation techniques protocol
  2. panelists training undergraduate admissions
  3. panelists training undergraduate admissions
  4. panelist training undergraduate admissions
  5. panelists training
  6. training
  7. panelists training undergraduate admissions presentation techniques protocol
  8. information
  9. presentation techniques protocol
  10. panelists training

Reasons why answers differ:

Image captions:

  1. A clipped paper packet has a training manual printed onto the pages.
  2. A grouping of papers that are paper clipped together.
  3. Educational related for panelists training undergraduate admissions notice is here.
  4. some sort of panel piece of paper that is yellow
  5. yellow paper of panelist training undergraduate admissions name on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_train_00004803.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a plate of food on the table.

Visual question: What is this?

Answers:

  1. food
  2. beef merlot frozen dinner
  3. healthy choice beef merlot
  4. frozen dinner
  5. box dinner
  6. healthy choice beef merlot
  7. stew
  8. beef merlot frozen dinner
  9. healthy choice beef merlot meal
  10. healthy choice beef merlot

Reasons why answers differ:

Image captions:

  1. A box of food is on a wood table
  2. A box of frozen healthy choice is on the edge of a table.
  3. A box of healthy choice beef merlot on a table
  4. A frozen dinner meal healthy choice, that displays steak and vegetables in a bowl.
  5. A ready to microwave meal from Healthy Choice with green and white packaging.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00007958.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a frisbee.

Visual question: What is this?

Answers:

  1. crackers
  2. crackers
  3. crackers
  4. package crackers
  5. crackers
  6. crackers
  7. crackers
  8. crackers
  9. crackers
  10. crackers

Reasons why answers differ:

Image captions:

  1. A container / package that contains various goods / edible / liquid items.
  2. A plastic wrapping containing cream flavored crackers held up.
  3. A yellow and red package of sweet crackers.
  4. In this picture is a image of a pack of snack
  5. Part of a crackers package is being held up by a person's hand, and there appears to be an office in the dark hazy background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 15: VizWiz_train_00007024.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a white tree.

Visual question: What are these pills?

Answers:

  1. midol
  2. unsuitable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. motrin
  7. unanswerable
  8. melatonin
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bottle lays down on top of a rug with colorful flowers' design.
  2. a medicine bottle with purple label and cap just off the side of image
  3. A purple bottle of melatonin supplements is on a white doilies mat on the table.
  4. Black bottle with purple lid laid on a kitchen table placemats
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_val_00002384.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cell phone on a table.

Visual question: What is in this little bottle?

Answers:

  1. nothing
  2. unanswerable
  3. unanswerable
  4. jelly
  5. unanswerable
  6. lotion
  7. cleaner
  8. unsuitable
  9. vodka
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a dark bottle on a wood table surface
  2. A dark color blue bottle has a label and sits on a wooden surface.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 17: VizWiz_train_00005101.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a brick wall with a bunch of it.

Visual question: what color is this?

Answers:

  1. brown
  2. maroon
  3. red
  4. burgundy
  5. red
  6. handbag
  7. burgundy
  8. red
  9. brown
  10. brown

Reasons why answers differ:

Image captions:

  1. A red leather purse has also a stitched in portion of alligator skin.
  2. dark red half circle, and light red and white edged squares below the half circle
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Red leather bag with crocodile prints on bottom.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 18: VizWiz_train_00016362.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle on a table.

Visual question: What kind of creamer is this?

Answers:

  1. coffeemate hazelnut
  2. hazelnut
  3. hazelnut
  4. hazelnut
  5. nestle
  6. hazelnut
  7. hazelnut
  8. hazelnut
  9. coffee mate
  10. coffee mate

Reasons why answers differ:

Image captions:

  1. A bottle of coffee creamer is on the table.
  2. A bottle of coffee creamer next to a lunchbox.
  3. a bottle of coffee mate hazelnut coffee creamer with a red top sitting on a table.
  4. a bottle of coffee mate hazelnut flavored creamer
  5. A view of a liquid coffee creamer mixer next to a lunch bag

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_val_00005910.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A big dog is resting in a suspended bed, above the ground.
  2. A dog is sitting right on top of the bed in the room.
  3. a dog that is in some sort of canopy bed relaxing
  4. A golden retriever sleeping in the backseat of a car.
  5. A large dog sitting in a hammock behind the passenger seat of a car.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 20: VizWiz_train_00001053.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of luggage sitting next to each other.

Visual question: Its not crashing.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. unanswerable
  8. unanswerable
  9. no
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A wheel of a rolling chair and a box nearby it
  2. An OfficeMax tablet and the floor of an office area.
  3. Appears to be a product of OFFICE MAX.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00013961.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is this?

Answers:

  1. creamy mash potato
  2. potatoes
  3. mashed potatoes
  4. mashed potatoes
  5. mashed potatoes
  6. potatoes
  7. mashed potatoes
  8. mashed potatoes
  9. mash potatoes
  10. mashed potato

Reasons why answers differ:

Image captions:

  1. a label of CREAMY MASH, British Maris Piper potato
  2. A pack of sealed container containing British potatoes
  3. a package of creamy mashed potatoes that is from a freezer
  4. A package of potatoes is red and black packaging and a picture of the food product on the front sits on a hard light colored surface
  5. a white color paper showing some cream color food

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 22: VizWiz_val_00001577.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a bottle of food on the table.

Visual question: What's this? What's in the vegetables?

Answers:

  1. spaghettios
  2. spaghetti os
  3. spaghetti os
  4. spaghetti os
  5. can alphabet spaghettios
  6. spaghetti os tomato sauce
  7. unanswerable
  8. spaghetti os
  9. spaghettios
  10. spagetti

Reasons why answers differ:

Image captions:

  1. a can of Campbell's spaghetti-o's on someone's lap
  2. a cylindrical tin with Healthy Kids written on it
  3. A nutritional food label on a jar of Spaghetti O's
  4. a tin food can of Campbell's brand spaghettios
  5. An unopened can of spaghetti o's with shapes

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00018257.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a table.

Visual question: What is this?

Answers:

  1. wine
  2. book
  3. malbec wine
  4. wine
  5. malbec wine
  6. wine
  7. wine
  8. qwerd
  9. wine
  10. malbec

Reasons why answers differ:

Image captions:

  1. A bottle of 2010 Malbec Argentinian wine with brand Pascualtos rests on the lap of someone wearing blue jeans.
  2. a bottle of 2010 Malbec by Pascual Tos from Mendoza Argentina
  3. A bottle of red wine that has an orange and white label.
  4. Black wine bottle 2010 Malbec on human lap
  5. No driving now because there is a bottle of wine in your lap.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00005926.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book that is sitting on a table.

Visual question: What is this?

Answers:

  1. food
  2. unanswerable
  3. box
  4. unsuitable
  5. unanswerable
  6. soup mix
  7. unanswerable
  8. great value product
  9. ingredients list
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a back side of the packet of food item which describes nutritional values
  2. A package of a food item showing nutritional information
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The back side of a Walmart brand food product lying on a counter top.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 25: VizWiz_train_00006047.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plant with leaves.

Visual question: What is this?

Answers:

  1. leaves
  2. leaves on plant
  3. leaves
  4. leaves plant
  5. leaves
  6. leaves
  7. green leaves
  8. leaves
  9. prickly plant
  10. plant

Reasons why answers differ:

Image captions:

  1. a bunch of green leaves on a vine
  2. a combination of green leave and grey stem is displayed
  3. a picture showing the leaves in the forest from a top view section
  4. A shot of some leafy plants with sharp edges taken outside.
  5. Plant with semi dark green leaves that are in the shape of a heart.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_val_00000014.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What does my eye look like?

Answers:

  1. unsuitable image
  2. unanswerable
  3. unsuitable image
  4. unsuitable image
  5. crazy
  6. unsuitable image
  7. unsuitable image
  8. shiny
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bright blinding light is obstructing all view
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_train_00012453.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: what is this?

Answers:

  1. this sandal on ladys foot
  2. leg
  3. sandal on foot
  4. foot in sandal
  5. gold sandal
  6. sandles
  7. sandal
  8. sandal
  9. sandal
  10. shoe

Reasons why answers differ:

Image captions:

  1. A woman's foot in a sandal that has five gold straps.
  2. A woman's foot with red nail polish wearing a golden sandal, a greenish blue office chair and a golden coat.
  3. A woman's foot with red toe nail polish wearing a gold sandal.
  4. I am looking at a foot with red painted toenails wearing a bronze sandal.
  5. Red painted toenails with sandals are a bold statement, but that is what we are looking at here.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 28: VizWiz_train_00013413.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a yellow toy with a stuffed animal.

Visual question: What is this?

Answers:

  1. rubber chickie
  2. duck
  3. plastic toy turkey
  4. yellow turkey toy
  5. rubber duck
  6. rubber duck
  7. rubber duck
  8. duck
  9. rubber duck
  10. yellow turkey

Reasons why answers differ:

Image captions:

  1. A small green orange and pink kids toy.
  2. A yellow rubber duck with a pink and orange Mohawk, and something pink hanging from its peak.
  3. A yellow toy duck with a orange beak and pink and orange hair.
  4. Someone is hold a small yellow, pink and orange rubber duck toy.
  5. Yellow rubber ducky that has pink and orange spiked hair.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00013280.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: An image of a glass case in a room.

Visual question: What color is my fan?

Answers:

  1. black
  2. brown
  3. black brown
  4. black woodgrain
  5. tan
  6. brown black
  7. brown black
  8. silver brown
  9. brown black
  10. black brown

Reasons why answers differ:

Image captions:

  1. A brown and black portable heater with controls on the top sitting next to brown bricks.
  2. A piece of furniture with wood framing and a stainless steel inside.
  3. A space heater or movable fan with orange trim is standing next to a brick wall inside a room.
  4. A standing heater that has a fake wood finish.
  5. A Wind Curve tower fan with ingrained wood that functions as a fresh air ionizer.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 30: VizWiz_train_00005658.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog sitting on the floor of a room.

Visual question: What color is the carpet?

Answers:

  1. tan
  2. beige
  3. tan
  4. beige
  5. beige cream
  6. tan
  7. beige
  8. beige
  9. white
  10. grey

Reasons why answers differ:

Image captions:

  1. A carpeted room with a TV and ceramic pit bulls in the corner.
  2. An empty living room with some wooden furniture and plastic dog statues in the background.
  3. Decorative puppy figures line the wall next to the television.
  4. Three dog statues in the left side top corner of the picture
  5. Three dog statues two are sitting and one is laying on his back with his feet in the air.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00013180.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard with a laptop.

Visual question: What is this?

Answers:

  1. macbook pro
  2. laptop
  3. keyboard
  4. keyboard
  5. laptop
  6. laptop
  7. laptop keyboard
  8. laptop keyboard
  9. macbook pro laptop keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. a keyboard on a MacBook pro laptop computer
  2. A MacBook Pro is open and sitting on the desk.
  3. A MacBook Pro sideways with the keyboard displayed.
  4. A portion of a laptop keyboard with the keyboard reflected in the monitor.
  5. Up close picture of a MacBook Pro laptop keyboard.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 32: VizWiz_train_00017429.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a table.

Visual question: What kind of coffee is this? Thank you.

Answers:

  1. unanswerable
  2. backet
  3. guatemalua anteguia
  4. unanswerable
  5. coffee am
  6. coffeeam
  7. guatemala antigua
  8. unsuitable
  9. coffee am
  10. cant tell writing too small

Reasons why answers differ:

Image captions:

  1. a package of coffee grounds on someone's counter
  2. A package of coffee is sitting on the counter.
  3. a silver bag of coffee in Guatemala antigua
  4. A silver coffee beans bag on top of a flat surface.
  5. The long silver bag has coffee in it, and is from Guatemala.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00017000.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on a counter.

Visual question: What flavor is this can of soup? Thank you.

Answers:

  1. can soup
  2. cream chicken
  3. cream chicken
  4. cream chicken
  5. cream chicken
  6. fsfdsd
  7. cream chicken
  8. cream chicken
  9. cream chicken
  10. cream chicken

Reasons why answers differ:

Image captions:

  1. a can of Campbell's cream of chicken soup on a granite countertop
  2. A can of Campbell's cream of chicken soup on a kitchen counter.
  3. A picture of what appears to be some cans of soup.
  4. Two cans of soup are on top of a counter.
  5. Two tin cans of Campbell's brand soup are on the counter near a plug in.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00022151.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. 2 can of Arizona iced tea sitting on a kitchen counter
  2. two 15 ounce cans of Arizona lemon iced tea
  3. Two aluminum beverage cans are sitting on a yellow counter, in front of a flat white box and a toaster oven.
  4. Two bottles of Arizona Iced tea with Lemon Flavor.
  5. Two tin cans of a popular tea rest on a counter in front of a toaster oven.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_val_00005938.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a black color small phone placed on a brown table
  2. A blue pen is next to a black phone.
  3. A cellular phone is on the audio booth table next to some electronic gear.
  4. A machine with on and off buttons and several sliding switches, with a cell phone and pen in front of it.
  5. A sound board with several controls and a pen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 36: VizWiz_val_00001963.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A remote control sitting on top of a couch.

Visual question: What is this?

Answers:

  1. clicker
  2. tv remote
  3. remote control
  4. samsung remote control
  5. samsung tv remote
  6. remote
  7. remote control
  8. remote
  9. remote
  10. samsung remote

Reasons why answers differ:

Image captions:

  1. A black remote control for a Samsung TV.
  2. a black remote control of Samsung television set
  3. A remote control for a Samsung TV including volume and channel controls, an on off switch, and play/record/stop/pause buttons
  4. black remote control with different functions for a Samsung
  5. Black Samsung TV controller on a tan ottoman.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 37: VizWiz_train_00001633.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black refrigerator sitting next to a window.

Visual question: What kind of beer is this?

Answers:

  1. can budlight
  2. bud light
  3. bud light
  4. bud light
  5. bud light
  6. bud light
  7. budlight
  8. bud light
  9. budlight
  10. bud light

Reasons why answers differ:

Image captions:

  1. A blue aluminum can of Bud Light is on the counter near a window.
  2. A can of bud light beer sits on a table
  3. A container / package that contains various goods / edible / liquid items.
  4. Blue bud light beer can sitting on top of a table.
  5. Looks like a can of some light beer.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 38: VizWiz_train_00022417.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A large envelope that is yellowish in color with blue lettering the says Pull to Remove.
  2. a pink tag or label on the edge of a table
  3. Envelope with pull to remove on the right with an arrow pointing to the right
  4. Says Pull to Remove with an Arrow pointing to the Right toward the edge of paper
  5. The end of a tan and red piece of envelope

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_val_00000224.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is the time shown on the screen?

Answers:

  1. unsuitable image
  2. unknown
  3. unsuitable image
  4. unsuitable image
  5. unsuitable image
  6. blurry lines
  7. unsuitable image
  8. cannot see time on phone
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cell phone is on top of a table.
  2. A Nokia cell phone is sitting on the table.
  3. Small black phone with a blue image on the screen.
  4. The top half of a cell phone is shown laying on a table.
  5. The upper portion of a mobile phone on a wooden reflective surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 40: VizWiz_train_00012488.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a banana in their hand.

Visual question: What is this?

Answers:

  1. whole kernel corn
  2. corn
  3. canned corn
  4. corn
  5. whole kernel corn
  6. watties hawkes bay whole kernel corn
  7. corn
  8. corn
  9. corn
  10. corn

Reasons why answers differ:

Image captions:

  1. A person is holding a can of Wattie's whole kernel corn.
  2. a tin food can of Wattie's brand whole kernel corn
  3. a Watties brand whole kernel can of corn
  4. I see a can of corn in a person hand
  5. Someone holding a can of yellow green Wattie's with a sink and ajax at the back of it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00010421.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand on a bed.

Visual question: Can you identify it now, I hope? I have a very good idea what it is but, let's see.

Answers:

  1. this stack compact discs
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. cds candy
  6. no
  7. unanswerable
  8. unanswerable
  9. candy
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A black table and on the black table are two candy bars as well as a stack of CD's in clear containers as well as five cassette tapes in their plastic container.
  2. A person standing near a black table with CDs, chocolates, and cassette tape.
  3. A short of a table from above, where some CDs and audio tapes are on the right and a rollo and another type of candy bar are on the left.
  4. Stack of CD's and cassette tapes on a table next to 2 candy bars.
  5. Two wrapped containers of candy laying on a green desk next to blank CD's and tapes.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 42: VizWiz_train_00011365.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bowl of food.

Visual question: What is in this bag?

Answers:

  1. asian mixed vegetables
  2. vegetables
  3. vegetables
  4. vegetables
  5. vegetables
  6. asian vegetables
  7. vegetables
  8. asian vegetables beijing style
  9. veggies
  10. veggies

Reasons why answers differ:

Image captions:

  1. a bag of asian vegetable Beijing style soy stir fry
  2. A bag of Asian Vegetables in a Beijing Style Soy Sauce.
  3. A bag of frozen Asian stir fry vegetables lying on a granite counter.
  4. A bag of frozen vegetable stir fry is on the table.
  5. asian vegetable stir fry, frozen meal in a plastic bag, unopened

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00004400.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black and white cat laying in a dark room.

Visual question: What kind of cigarette is this?

Answers:

  1. pall mall
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. pall mall
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. newport
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A key chain with keys and other items with part of a hand in front of it.
  2. A set of keys with a store card on the ring lies next to an undetermined object that might be a cigarette.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Several keys are on a key chain on a table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 44: VizWiz_train_00023054.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bright red piece of fabric with a black and silver zipper with a clear pull tag on it
  2. A closed zipper pocket on a red textile object.
  3. A mostly red colored bag with a zipper across it.
  4. A red bag of some sort with a black zipper line.
  5. The zipper is closed on the red garment or handbag.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 45: VizWiz_val_00003667.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a keyboard on a computer.

Visual question: What does that photo say?

Answers:

  1. tnpeyt
  2. inpeyt
  3. tnpeyt
  4. inpeyt
  5. tnpeyt
  6. inpeyt
  7. tnpeyt
  8. unsuitable
  9. tnpeyt
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A black and white captcha code with a net like pattern in the background, displaying the letters, INPUT.
  2. a black and white screen captcha that reads input
  3. a captcha with TNPEYT on a black and white background
  4. A close up of text used for online security verification purposes.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 46: VizWiz_train_00005685.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: OK, I turned the can around a little bit. Can you tell me what's in the can? Thank you!

Answers:

  1. beans
  2. black beans
  3. beans
  4. beans
  5. unanswerable
  6. beans
  7. unsuitable
  8. beans
  9. beans
  10. beans

Reasons why answers differ:

Image captions:

  1. A store-bought can of beans held in the palm of a hand.
  2. Half a hand and a can of some kind of beans
  3. Inside of a kidney beans can inside of a man's hand
  4. Pictured is a can of beans in the palm of someone's hand.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00015887.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a window with a light.

Visual question: What is this please tell me.

Answers:

  1. container
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A very wonderful view and worth seeing at all times, my friend
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The nutrition facts for the food are on the back of the packaging.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 48: VizWiz_val_00002499.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a couch with a blue shirt.

Visual question: What is this?

Answers:

  1. leg
  2. jeans
  3. thigh
  4. pants leg
  5. arm couch
  6. pants
  7. pillow
  8. unsuitable
  9. pant leg
  10. leg

Reasons why answers differ:

Image captions:

  1. A person's leg wearing a pair of blue and white jeans.
  2. An image of a piece of grayish white material.
  3. Quality issues are too severe to recognize visual content.
  4. The fabric is blue and appears grainy in texture on the body.
  5. the woven pattern of the cloth of a pair of pants

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 49: VizWiz_train_00023682.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is this captcha?

Answers:

  1. unsuitable image
  2. unsuitable image
  3. unsuitable image
  4. unsuitable image
  5. unsuitable image
  6. unanswerable
  7. unsuitable image
  8. unanswerable
  9. too far away to read letters

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 50: VizWiz_train_00015955.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a table and a laptop.

Visual question: I'm just testing. What is this? Thank you.

Answers:

  1. dog
  2. view room
  3. table
  4. table
  5. table
  6. table black dog
  7. sowing machine
  8. desk
  9. table
  10. glass top table

Reasons why answers differ:

Image captions:

  1. A black dog laying on the floor in front of a glass table
  2. A black dog, desk and some things shown in the image.
  3. A dog sits on the wooden floor underneath a table covered in miscellaneous items like coffee cups and books.
  4. A glass table with lots of items on it, which are hard to see, and a black dog lying on floor next to it
  5. A table with a lot of things on it and a black dog laying on the ground.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Showing images 0 - 0 out of 0 matching images.