Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00003206.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A flat screen tv sitting on top of a television.

Visual question: Tell me, how big is it?

Answers:

  1. 32
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. 48 inches
  6. 22 inch tv
  7. 40
  8. 20 inches
  9. 32 inch
  10. 56 inches

Reasons why answers differ:

Image captions:

  1. A crowded TV stand showing a program that also has a cable box and sound system; a shelf is above the TV with various toys and trophies
  2. A television sitting on top of a brown shelf with other devices
  3. A TV sits on an entertainment stand as well as other equipment.
  4. Pictured is a television sitting on a wood stand in a cluttered room.
  5. Small TV hutch holding a flat screen television below multiple shelves crammed with numerous personal items.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 2: VizWiz_train_00012673.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a couch.

Visual question: what color?

Answers:

  1. olive green
  2. brown
  3. grey green
  4. brown tan
  5. grey
  6. brown white
  7. green
  8. green white
  9. khaki green cream
  10. grey

Reasons why answers differ:

Image captions:

  1. A human wear trousers and sweatshirt setting most likely in living room
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. White sweater over an army green pair of sweat pants.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 3: VizWiz_train_00022290.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of a liquid has a blue label
  2. a bottle of wine from Chateau de Jurque
  3. A close up of a bottle of wine with a blue label.
  4. Label of a chateau de jurque bottle with their emblem
  5. The front side of a black bottle of wine with blue coloring

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 4: VizWiz_val_00002886.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall with graffiti on it.

Visual question: what city is this?

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. cd
  7. unsuitable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A CD with a white label with black text that someone wrote with a marker.
  2. A very wonderful view and worth seeing at all times, my friend
  3. A white CD has dark black text on it
  4. CD with black handwriting across the label supported by a hand
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_train_00015148.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A television that is sitting on a wall.

Visual question: Do I have any missing calls?

Answers:

  1. unanswerable
  2. no
  3. 0
  4. no
  5. no
  6. no
  7. nope
  8. yes
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. A cell phone by Samsung has a display showing the time and a photo of cardamom pods.
  2. a SAMSUNG DUOS mobile phone with FM Radio
  3. An electrical device Samsung duos mobile phone operating.
  4. The front screen of a cellular phone with the time displayed (23:18).
  5. Timing is seen on a mobile phone hold by someone in hand

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00010894.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cloudy sky with a white background.

Visual question: What kind of wine is this?

Answers:

  1. unanswerable
  2. unsuitable
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unanswerable
  8. white
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 7: VizWiz_train_00015868.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a frisbee.

Visual question: What is this a case for?

Answers:

  1. unsuitable
  2. cellphone
  3. unanswerable
  4. unsuitable
  5. unanswerable
  6. clothes
  7. unsuitable
  8. beer
  9. unsuitable
  10. cell phone

Reasons why answers differ:

Image captions:

  1. A blanket has a cartoon music instrument on it
  2. A cartoon drawing or book cover with a record player on it
  3. Blurry, cartoon of a gramophone, possibly printed on fabric.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 8: VizWiz_train_00019084.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a green and white bed.

Visual question: WHAT COLOR IS THIS?

Answers:

  1. brown white green
  2. white green brown
  3. green brown white
  4. green white tan
  5. white green yellow brown stripes
  6. white green tan brown stripes
  7. white green brown tan striped
  8. brown yellow white green
  9. white cloth green brown stripes
  10. green white brown tan stripes

Reasons why answers differ:

Image captions:

  1. A part of someone's bed sheets with white, green, and tan stripes on them.
  2. A tan, green and white striped piece of fabric
  3. Piece of white fabric with green, brown, and beige stripes going across it.
  4. Striped sheets are shown on top of a bed.
  5. White kitchen town with brown, green and beige stripes.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_train_00002091.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bench on a wall.

Visual question: What flavor of ice cream is this?

Answers:

  1. chocolate
  2. unanswerable
  3. chocolate
  4. unanswerable
  5. chocolate
  6. chocolate
  7. chocolate
  8. chocolate
  9. chocolate
  10. chocolate

Reasons why answers differ:

Image captions:

  1. A box of chocolate ice cream sitting on a white tile surface
  2. A box of thrifty ice cream has chocolate ice cream on it.
  3. a picture of an ice cream can with a chocolate ball on the outside.
  4. Package for ice cream that shows image of the product on the cover.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_train_00007376.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that has a sticker on it.

Visual question: What flavor is this?

Answers:

  1. peach mango
  2. peach mango
  3. peach mango
  4. peach mango
  5. peach mango
  6. peach mango
  7. peach mango
  8. peach mango
  9. peach mango
  10. peach mango

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of hand soap is laying on top of a green cloth.
  2. A bottle of peach mango scented hand soap
  3. a package of peach mango hand soap clear on a blue surface
  4. A plastic bottle of hand washing soap with a large red label.
  5. Bottle of hand soap containing peach mango flavor

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_val_00005565.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A can of food sits on top of a dark leather chair.
  2. Can of Hanover brand vegetables on a a chair
  3. Looks like a photo of a can on a black car seat.
  4. Quality issues are too severe to recognize visual content.
  5. The can is upside down and has a green label on the front.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_val_00002939.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A frisbee sitting next to a bottle of water.

Visual question: All right they are closed now. I don't think there are any labels on there though, but if there are any labels on there. I don't know, I think it's the right one too.

Answers:

  1. unanswerable
  2. label on left 1
  3. no
  4. so matte perfect stay
  5. unanswerable
  6. lids
  7. unanswerable
  8. so mate perfect stay
  9. so matte perfect stay on left right blank
  10. miss sport so matte perfect stay

Reasons why answers differ:

Image captions:

  1. a blue and a yellow color circular object are beside each other
  2. a blue tun of perfect stay makeup and some other beauty products sitting on a dark surface
  3. Matte pressed powder in a blue flat case next to an orange container
  4. Multiple metal containers sitting on a Darkly stained wooden surface
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 13: VizWiz_train_00020919.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A pile of metal machinery parts and electronics on a wooden table.
  2. A silver machine is deconstructed into many different parts.
  3. An object is dismantled on top of a table.
  4. Parts of some instrument are on the floor.
  5. some sort of machine, or pieces of a machine, small pieces fill a tray, film, computer chips, etc, a light bulb can also be seen, not sure what kind of machine or mechanism this is.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 14: VizWiz_train_00017013.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A green night shot of a dark ball.

Visual question: What is this?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. black
  5. nothing
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A black image with the absence of any identifiable features.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 15: VizWiz_train_00010984.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: What is in this can, please?

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Side view of cans label with nutritional facts and heating instructions on it.
  3. some container in white color is seen and labelled with blue color wordings
  4. The back of the bottle of some sort of food.
  5. The nutrition and heating instructions on a can of food

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_train_00001329.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a sign that is on the wall.

Visual question: What kind of sauce is this?

Answers:

  1. ranch
  2. asedwrfaf
  3. heartland ranch
  4. wendys heartland ranch
  5. rance
  6. ranch
  7. wendy┬╗s heartland ranch
  8. wendys heartland ranch
  9. dipping sauce
  10. ranch

Reasons why answers differ:

Image captions:

  1. A container of Wendy's Heartland Ranch on a tiled table
  2. a cup of Wendy's heartland ranch dipping sauce
  3. A cup of Wendy's ranch is on the floor
  4. A single serving container of Wendy's Heartland Ranch Dipping Sauce.
  5. SMALL CONTAINER OF RANCH PLACED ON A FLOOR

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00017888.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock sitting on a table.

Visual question: What is this?

Answers:

  1. can
  2. pop top can
  3. can
  4. soup can
  5. unsuitable
  6. can
  7. pull tab aluminum can
  8. can
  9. can
  10. this top can

Reasons why answers differ:

Image captions:

  1. A closed cans lid with a pop top type device which has been factory sealed.
  2. showing the pop top of a unopened can of food
  3. The can has a silver top and a pull tab to open it.
  4. The top of a can with a pull tab bearing instructions on opening the can.
  5. Top of a lift tab top steel can that has caution on it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_val_00006560.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a cd of some sort that u can listen to stuff on
  2. An orange box is on top of the table.
  3. On a counter is a tea pouch with an orange and white front.
  4. Orange unopened tea bag packet with instructions on it in black and orange writing.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 19: VizWiz_train_00019431.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: Is this lemon juice or lime juice, thank you.

Answers:

  1. erd
  2. lime
  3. lime juice
  4. lime juice
  5. lime juice
  6. lime
  7. lime
  8. lime
  9. lime juice
  10. lime juice

Reasons why answers differ:

Image captions:

  1. A bottle of lime concentrate against a black background.
  2. a bottle of lime juice with the bottle being green
  3. a bottle of natural strength lime juice from concentrate
  4. A green bottle of lime juice with a branded label is on the ground.
  5. Bottle of Lime Juice from Concentrate, cannot read what brand.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00002415.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is holding a red blanket on the wall.

Visual question: What color is this shirt?

Answers:

  1. red
  2. red
  3. red
  4. red
  5. red
  6. red
  7. red
  8. red
  9. red
  10. red

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bright red piece of cloth being held by someone.
  2. A hand holding a clean bright red T-shirt with no decoration on it.
  3. A hand is holding a bright red sweatshirt
  4. A hand that is holding a red shirt stretched out.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 21: VizWiz_train_00000300.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A statue of a toy of a glass.

Visual question: What is this?

Answers:

  1. decoration
  2. drinking bird
  3. dunking bird
  4. decoration
  5. bobbing toy
  6. toy
  7. toy bird glass water
  8. drinking bird toy
  9. drinking duck glass water
  10. toy

Reasons why answers differ:

Image captions:

  1. A glass of water next to a bird toy with a blue hat and green tail feather
  2. A red and white "drinking bird'" toy on a windowsill.
  3. A toy that has a rooster head witch can bob back and forth
  4. Before a window sits a glass of water and a bird toy.
  5. Pictured is a bobber bird over a glass of water on a counter top.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_train_00017091.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A table with a book on top of it.

Visual question: What is this?

Answers:

  1. table
  2. desk top
  3. table
  4. table
  5. unanswerable
  6. table
  7. floor
  8. table
  9. table
  10. table

Reasons why answers differ:

Image captions:

  1. A wooden board has many paper cards on it
  2. A wooden desk is shown with various items on it
  3. A wooden table with glasses on top of a pile of cards.
  4. I can see reading glasses on a table with possibly bank checks and stubs.
  5. In this picture is a image of a wood floor

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 23: VizWiz_train_00022464.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a small excerpt of text that is on a red background
  2. Dark burgundy product package with information on the Twinings Story.
  3. Quality issues are too severe to recognize visual content.
  4. the lower end of a package of Twinings tea where the story is located
  5. The red label on a box of Twinings brand tea

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00020865.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue tint surrounds the picture taken at a fast speed.
  2. Only a beam of light can be seen on a blue/gray background.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_train_00018578.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a microwave on a table.

Visual question: Where can I buy it?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. wee
  5. office supply store
  6. walmart
  7. store
  8. walmart
  9. unanswerable
  10. in shop online

Reasons why answers differ:

Image captions:

  1. A flash data memory stick USB is on a white table.
  2. A Jet Fresh brand 4 GB USB thumb drive.
  3. A small white flash drive with green plastic in the middle displaying the words JetFlash and 4GB
  4. a white 4GB JetFlash placed on the white surface
  5. A white Jet Flash 4 GB drive sitting on a surface.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_train_00016145.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white oven.

Visual question: This is the display of a treadmill, could you tell me the distance in miles please?

Answers:

  1. 3.29
  2. 3.29
  3. dont know
  4. 3.29
  5. 329
  6. 3.29
  7. 3.29
  8. 3.29
  9. 3.29
  10. 3.29

Reasons why answers differ:

Image captions:

  1. a digital machine that reads your speed or rate you while exercising
  2. An led screen and buttons on a treadmill.
  3. Digital display for a treadmill with different shaped buttons for different settings.
  4. The display and control pad of an exercise machine.
  5. The LCD display of an exercise machine is shown

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 27: VizWiz_val_00007707.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A screen of some sort with three men on it and another person behind them, a possible joystick in front of it.
  2. A screen showing a man in a police uniform talking to two men in blue shirts and dark pants.
  3. A television screen with a police officer and 2 men in blue shirts depicted on the screen.
  4. Television screen with four men on the screen.
  5. Two men on a screen talking to a police officer.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 28: VizWiz_train_00011650.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of water.

Visual question: What is this beverage?

Answers:

  1. aerfa
  2. diet coke
  3. diet coke
  4. diet coke
  5. diet coke
  6. diet coke
  7. diet coke
  8. diet coke
  9. diet coke
  10. diet coke

Reasons why answers differ:

Image captions:

  1. An empty 20 oz plastic bottle of Diet coke
  2. An empty bottle of Diet coke sits on a wooden table.
  3. An empty Coca-Cola small diet bottle on a wood surface.
  4. Empty bottle of diet coke on a wood desk.
  5. Someone has emptied a bottle of diet coke on the table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 29: VizWiz_train_00012089.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a cup of food on a table.

Visual question: Can you please tell me what kind of tea or coffee this is?

Answers:

  1. celestial
  2. sweet peach black tea
  3. sweet peach
  4. sweet peach
  5. celestial seasonings sweet peach black tea
  6. iced tea
  7. peach tea
  8. sweet peach tea
  9. celestial peach iced tea
  10. iced tea

Reasons why answers differ:

Image captions:

  1. A orange box containing items to make Iced tea.
  2. A rear end of a box of peach iced tea is sitting on a surface, the tea is described in text on the back of the box.
  3. Box of Celestial Seasonings Sweet Peach Black Tea for iced tea.
  4. Orange and gray cardboard box on a desk
  5. Sweet peach flavored black tea for iced tea.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_train_00007252.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a sign.

Visual question: What is on this t shirt? I know there's a knight. But what's the words?

Answers:

  1. knights newcastle
  2. newcastle
  3. knights newcastle
  4. knight
  5. knights newcastle
  6. newcastle
  7. knights newcastle alliance
  8. knights newcastle community alliance
  9. knights newcastle community alliance
  10. knights newcastle community alliance

Reasons why answers differ:

Image captions:

  1. A white garment or other piece of cloth has an illustrated logo of a knight's helmet and says "Knights Newcastle".
  2. Part of a T-shirt with a Knights Newcastle logo can be seen, and there is a hand to the left of the T-shirt.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The fabric has the words Knights Newcastle on it and community alliance.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_train_00009735.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A wall with a picture of a cat on it.

Visual question: What's the number?

Answers:

  1. 10
  2. 10
  3. 10
  4. 10
  5. 10
  6. 10
  7. 10 clubs
  8. 10
  9. 10
  10. card

Reasons why answers differ:

Image captions:

  1. A 10 of Clubs card from a standard deck of cards.
  2. A playing card featuring the number 10 and 10 Black club icons
  3. One ten of clubs card on a white surface.
  4. The 10 of clubs playing car laying on a shiny surface.
  5. The top half of a 10 of clubs card

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 32: VizWiz_train_00006211.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a purple blanket on it.

Visual question: What color is this cloth?

Answers:

  1. pink
  2. pink
  3. pink
  4. pink
  5. pink
  6. pink
  7. pink
  8. purple
  9. pink
  10. pink

Reasons why answers differ:

Image captions:

  1. A close up of a pink fabric with focus on the stitching of a pocket.
  2. A hot pink piece of fabric with a pocket.
  3. A piece of pink fabric with what seems like a pocket
  4. I see a brightly colored pink piece of clothing with a pocket sewn on it.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 33: VizWiz_train_00020166.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black table with glass case holds a television set and a wireless house thermostat on top.
  2. A black television with a grey DVD player on top.
  3. A photo of an entertainment center that is reflecting a person's foot.
  4. An old black box TV sits on the ground unplugged with some other electronic devices on top of it
  5. the corner of a black media stand, sitting on carpet

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 34: VizWiz_train_00006617.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dark up of a black sky at night.

Visual question: What color is this?

Answers:

  1. black
  2. black
  3. unsuitable
  4. black
  5. black
  6. black
  7. black
  8. black
  9. unsuitable
  10. black

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00002247.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog that is standing on the floor.

Visual question: What color is this dog?

Answers:

  1. brown
  2. tan
  3. brown
  4. brown
  5. golden brown
  6. unsuitable
  7. reddish brown
  8. light brown
  9. brown
  10. brown color dog

Reasons why answers differ:

Image captions:

  1. -"There is/are dog "
    -"This is /These are dog "
    -"The/This image/picture dog"
    -"It is/ It's dog "
  2. a tile floor with a bit of a dog in the corner
  3. An animal is asleep on a tile floor.
  4. Partial ear and head of a dog laying on a tiled floor
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00016922.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a knife.

Visual question: What is this?

Answers:

  1. cherry cough drops
  2. toaster pastries
  3. unsuitable
  4. dafgv
  5. cherry toaster pastries
  6. unsuitable
  7. unsuitable
  8. toasted pastries
  9. cherry toaster pastries
  10. frosted cherry toaster pastries

Reasons why answers differ:

Image captions:

  1. 16 pack of Cherry flavor toaster pastries from Walmart
  2. A box of frosted cherry toaster pastries with cherries.
  3. A white and red box of cherry breakfast toaster pastries.
  4. box of frosted cherry toaster pastries it contains 16
  5. The front side of a cherry box with white coloring

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_train_00000529.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person wearing a tie.

Visual question: What color are these pants?

Answers:

  1. pink
  2. pink
  3. pink
  4. pink
  5. pink
  6. pink
  7. pink
  8. pink
  9. pink
  10. pink

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A pink piece of fabric folded over itself.
  2. A pink sheet of fabric sits folded on someone's lap.
  3. A wrinkled pair of pink or khaki pants.
  4. an image of cloth, possibly a pink jacket
  5. Close up of pink knitted clothing item, possibly a skirt.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 38: VizWiz_val_00001828.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall with a wooden background.

Visual question: Can you give me any information on this radio? Availability, repair, other stuff? Thanks.

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. brown
  5. unsuitable
  6. no
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a black alarm clock or radio with the antenna up
  2. A pair of white cabinet doors in a room with white walls.
  3. A radio with an antenna extended from the top.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 39: VizWiz_val_00005708.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of body lotion that is on the table.
  2. A half full bottle of white musk body lotion sits on a desk.
  3. a tall white bottle with lotion inside of it
  4. A very wonderful view and worth seeing at all times, my friend
  5. A white bottle of body lotion on a wooden surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00008266.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of an apple on a table.

Visual question: Does this onion look ok it's already been cut.

Answers:

  1. yes
  2. no
  3. yes
  4. yes
  5. yes
  6. yes
  7. yes
  8. yes
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. a picture of a melting onions on a white table disgusting
  2. A sliced purple onion on a white, flat surface.
  3. A white surface with a half cut red onion
  4. Half of a purple onion placed on a white counter.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 41: VizWiz_train_00001749.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of food.

Visual question: What type of pills are these?

Answers:

  1. kirkland signature natural omega 3 fish oil 1000 mg
  2. omega 3 fish oil
  3. fish oil
  4. fish oil
  5. omega 3 fish oil 1000 mg
  6. omega 3 fish oil 1000mg
  7. fish oil
  8. fish oil
  9. fish oil
  10. fish oil 1000mg

Reasons why answers differ:

Image captions:

  1. A bottle of a supplement held by a person.
  2. a bottle of Kirkland brand omega 3 fish oil 1000mg
  3. A large white bottle of supplements stating "Kirkland Signature, Natural Omega 3 Fish Oil, 1000 mg" the majority of the bottle is yellow and takes up the whole picture.
  4. Bottle of Kirkland natural omega 3 fish oil 1000 mg capsules.
  5. In this picture is a image of omega fish oil

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_val_00003357.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a table.

Visual question: What is this?

Answers:

  1. internet security
  2. internet security software
  3. 2015 internet security book
  4. internet security 2011
  5. security system
  6. internet security program
  7. anti virus
  8. software
  9. opuell
  10. internet security

Reasons why answers differ:

Image captions:

  1. A sign with a 2011 version of a pc security program in it.
  2. A white label on a foreign language box of an internet security software product.
  3. An advertisement in a magazine or other paper for an Internet security software.
  4. Bookstore advertisement for 2011 Internet Security by PC Tools.
  5. Packaging for internet security software by the brand PC Tools.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00009938.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of food on a table.

Visual question: What is this?

Answers:

  1. wine
  2. saint henri chaleauneuf du pape
  3. bottle wine saint henri
  4. wine
  5. wine
  6. chaleauneuf du pape
  7. wine
  8. wine
  9. saint henri chaleauneuf du pape 2005
  10. saint henri chaleaumeuf du pape

Reasons why answers differ:

Image captions:

  1. A bottle of what looks like an alcoholic beverage, year 2005.
  2. a container/ box / bottle that contains liquid / goods.
  3. Dark colored glass bottle of Saint-Henri Chateauneuf-du-Pape with the year date of 2005 and a white label with black and white lettering and picture of castle.
  4. image shows a champagne bottle called saint Henri on a table.
  5. On top of a box sits a bottle of red Saint-Henri wine from the year 2005.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 44: VizWiz_train_00006524.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a piece of paper.

Visual question: Is this starch for ironing clothes?

Answers:

  1. unsuitable
  2. unsuitable
  3. no
  4. unsuitable
  5. unsuitable
  6. yes
  7. no
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a hand holding something brown, yellow and white with a table in background
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. someone holding a dark brown item with white on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 45: VizWiz_train_00008358.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A red picture of a bright wall.

Visual question: What color is my shirt?

Answers:

  1. orange
  2. red
  3. red
  4. orange
  5. red
  6. red
  7. red
  8. red
  9. red
  10. red

Reasons why answers differ:

Image captions:

  1. a bright red piece of fabric with texture
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 46: VizWiz_train_00001635.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A box of pizza sitting on top of a table.

Visual question: What is this?

Answers:

  1. frozen meal
  2. frozen dinner
  3. smart simple prepared meal
  4. smart simple seafood microwaveable dinner
  5. unsuitable
  6. tv dinner
  7. unsuitable
  8. recipe instructions
  9. smart simple meal
  10. microwave dinner

Reasons why answers differ:

Image captions:

  1. A frozen meal of shrimp called smart and simple.
  2. A Smart & Simple frozen dinner of Shrimp and peppers in orange packaging.
  3. An oval container with a frozen Fresh Direct smart and simple shrimp dinner sitting on a counter top.
  4. package of FreshDirect branded Smart and Simple food with shrimp and vegetables on the label
  5. Smart and simple Fresh Direct microwavable shrimp meal in a box.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00019454.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bunch of scissors sitting on top of a table.

Visual question: In the following diagrams draw the shape of the water in the Earth's oceans tidal budges. Label each diagram with a moon face identify if you are experiencing high tide or low tide.

Answers:

  1. high
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. no

Reasons why answers differ:

Image captions:

  1. A piece of paper with images of the moon and it's phases.
  2. A worksheet indicating moon phases and tides with an illustration of the moon relative to the Earth and a solar photograph.
  3. An excerpt from a page of a workbook.
  4. stapled sheets of paper show a science homework assignment, about moon phases and tides.
  5. Worksheet about Moon phases and tides with photos and 4 different settings

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00008481.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a brown wall on the ground.

Visual question: What color socks are those?

Answers:

  1. white
  2. unsuitable
  3. beige
  4. off white
  5. tan
  6. white
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a wrinkled cover laying on a bed in a room.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 49: VizWiz_val_00004163.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock on a wall.

Visual question: what is this thermostat temperature set at

Answers:

  1. unsuitable
  2. i dont know
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. 78
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A small white object with buttons and a display window is displayed to one side of the picture.
  2. A white thermostat is placed on an orange toned wall.
  3. A white wall mounted thermostat with a black pasted note under the unit.
  4. Quality issues are too severe to recognize visual content.
  5. Thermostat on a tan wall in a house with no text displayed on the screen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 50: VizWiz_train_00017408.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of beer.

Visual question: What soap is this?

Answers:

  1. borsch
  2. unanswerable
  3. borsch
  4. no soap
  5. unanswerable
  6. borsch
  7. borsch
  8. borsch
  9. borsch
  10. borsch soup

Reasons why answers differ:

Image captions:

  1. A Borsch soup can being held in hand.
  2. A can of Campbell's brand Borsch variety soup.
  3. A photo of red and white label Campbell's borsch soup.
  4. A silver can of soup in a red and white label in a person's hand.
  5. appears to be a picture of a can of food

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.