Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00002699.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A group of dogs standing on the floor.

Visual question: What color is the carpet

Answers:

  1. beige
  2. no carpet
  3. tan
  4. brown
  5. light grey
  6. unsuitable
  7. grey
  8. grey
  9. taupe
  10. brown

Reasons why answers differ:

Image captions:

  1. a brown carpet with an entertainment center in the corner of the room and some dog statues against the wall
  2. A room showing carpet layout, with nik naks and furniture in the corner of room
  3. A room with a tan carpet, an entertainment center, and some dogs in the corner
  4. Corner of a living room with entertainment center and 3 dogs
  5. Three dogs, two sitting and one on his back, in a living room next to a TV stand.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 2: VizWiz_train_00018517.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a stop sign that says \

Visual question: What book is this? Thank you.

Answers:

  1. vengeance
  2. vengeance
  3. vengeance
  4. vengeance
  5. vengeance
  6. vengeance
  7. vengeance
  8. vengeance
  9. vengeance
  10. vengeance

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A book with a red cover and a silhouette of a gun.
  2. A poster is on top of the wall.
  3. A red book with white text has a picture of a gun
  4. Quality issues are too severe to recognize visual content.
  5. The top side of a vengeance gun book

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 3: VizWiz_train_00005255.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a green blanket.

Visual question: What color is it?

Answers:

  1. blue
  2. white green trim
  3. blue
  4. white
  5. green
  6. light green
  7. seafoam
  8. teal
  9. green
  10. green

Reasons why answers differ:

Image captions:

  1. A teal colored fabric has dark green edges
  2. A thick white piece of fabric with green piping on the edge.
  3. A white cloth with a green border is folded loosely.
  4. A white piece of clothing has a green edge sewn into it.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 4: VizWiz_train_00007475.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a yellow surfboard on a table.

Visual question: What is this item?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. box
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. olives
  10. food

Reasons why answers differ:

Image captions:

  1. A bottle with a yellow label resting on a counter top.
  2. A can of tin goods with a bright yellow label.
  3. Can or canister with yellow label resting on counter top
  4. The bag is yellow and white in color and is packed so it can stand up.
  5. yellow can or bottle on a green countertop.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 5: VizWiz_val_00000547.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What kind of phone is this?

Answers:

  1. blackberry
  2. blackberry
  3. blackberry
  4. blackberry
  5. blackberry
  6. blackberry curve 3g
  7. blackberry
  8. blackberry
  9. blackberry
  10. blackberry

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A BlackBerry phone with screen on and an alert for three messages.
  2. A box containing a blackberry curve 3g cell phone.
  3. A portion of the packaging for a BlackBerry cellular phone.
  4. An advertisement for the BlackBerry Curve 3G phone.
  5. The paper casing to a BlackBerry Curve phone displays a close-up image of the phone; the phone has a keyboard and various buttons for calls and other functions.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 6: VizWiz_train_00004823.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person ' s hand in a kitchen.

Visual question: What is this?

Answers:

  1. dots candy
  2. dots candy
  3. dots candy
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. candy box
  8. dots
  9. candy
  10. door

Reasons why answers differ:

Image captions:

  1. a box of dots candy showing the nutrition label
  2. A small portion of a person's hand can be seen holding a box of candy Dots with a front door on a green wall with a clock in the background.
  3. a yellow box of dots candy, a white door and a plaid sofa
  4. Someone's house door is shown with a box of candy in the corner of the camera.
  5. The box is yellow in color and has the nutrition facts displayed.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00000473.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a picture of a man in the window.

Visual question: What is it?

Answers:

  1. dollar
  2. money
  3. 1 dollar bill
  4. 1 dollor
  5. $1 bill
  6. 1 dollar bill
  7. 1 dollar bill
  8. dollar bill
  9. 1 dollar bill
  10. dollar bill

Reasons why answers differ:

Image captions:

  1. A dollar billed faced up sitting on a white surface table.
  2. A note of money is placed on a white surface.
  3. A single dollar bill lying face-up on a white table.
  4. A US one dollar bill sitting against a white surface.
  5. Chose a $1 bill sitting on a white counter

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 8: VizWiz_train_00013926.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A table with a candle on top of it.

Visual question: What is this

Answers:

  1. unsuitable
  2. lamp
  3. unanswerable
  4. table lamp
  5. small glass jar
  6. desk
  7. desk lamp objects
  8. desk lamp
  9. desk
  10. unanswered

Reasons why answers differ:

Image captions:

  1. a light on a desk with a few canisters and shadows on the wall
  2. A nightstand containing two jars, some books and a table lamp.
  3. A silver lamp on a table casting shadows on a wall.
  4. Jars on top of a side table with a desk lamp.
  5. Small table with a can, a jar, a reading lamp and binder.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 9: VizWiz_train_00001319.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog that is standing in a room.

Visual question: This is a test, this is a test, this is a test, this is a test.

Answers:

  1. unanswerable
  2. blur
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. ok test
  10. yes

Reasons why answers differ:

Image captions:

  1. An black iron rod stand with three brown bins that have items in the bins.
  2. Beautiful view from behind the walls hidden under dark mist
  3. IMAGE WAS CLEAR BUTS ITS ITEAM SO ITS DARK
  4. Set of black shelves in a carpeted living room
  5. Wire and wicker basket organizer on carpeted floor.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 10: VizWiz_train_00002088.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a man on a wooden surface.

Visual question: What color is this shirt?

Answers:

  1. white
  2. grey
  3. purple
  4. beige
  5. grey
  6. white
  7. white
  8. grey
  9. white
  10. white

Reasons why answers differ:

Image captions:

  1. a large white and grey cloth with some wrinkles
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 11: VizWiz_train_00012564.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black and white toothbrush.

Visual question: What is this?

Answers:

  1. fan
  2. fan
  3. fan
  4. fan
  5. heater
  6. air purifier
  7. machine
  8. fan
  9. dont know
  10. fan

Reasons why answers differ:

Image captions:

  1. A air conditioning machine that is placed on a window sill.
  2. a light grey standing fan in a windowsill
  3. A small portable air conditioning fan is seen in a white hallway entrance.
  4. A tower fan is sitting on a window seat, beside brown patterned curtains.
  5. Tall black oscillating fan with cord wrapped around stand

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 12: VizWiz_train_00015032.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A blue sign sitting on top of a table.

Visual question: What flavor noodles are these?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unsuitable
  7. parmesan
  8. unanswerable
  9. parmesan
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A big blue box of Pasta Roni next to a jar of kitchen utensils.
  2. A blue box of ready to cook Pasta Roni is in the kitchen.
  3. A single box of Pasta Roni that is unopened on the kitchen counter.
  4. Box of Pasta Roni with kitchen household items in the back
  5. Packaging for a box of Pasta Roni brand pasta.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_train_00007545.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a baseball game.

Visual question: What is this box?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A close up of a box of something that says Easy on it
  2. a close up of a package of microwavable food
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00014197.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person sitting on top of a bed.

Visual question: Tell me what the scale says please. Thank you.

Answers:

  1. 105.2
  2. 105.2
  3. 105.2
  4. 105.2
  5. 105.2
  6. 105.2
  7. 105.2
  8. unsuitable
  9. 105.2
  10. 105.2

Reasons why answers differ:

Image captions:

  1. A downward shot of shoeless feet on bathroom scale weighing about 105 pounds and floor is white square tiles.
  2. a person standing in the machine to know weight
  3. a person standing on a scale that reads 105
  4. a person standing on a scale, whose weight reads 1052
  5. A silver electronic device has several numerical digits.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 15: VizWiz_train_00006507.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a microwave on a counter.

Visual question: What is the countdown time on this timer?

Answers:

  1. 3 hours 6 minutes 27 seconds
  2. 3:06
  3. 3 minutes 6 seconds
  4. 3:06 27
  5. 3:06
  6. 3:06
  7. 3:06.2
  8. 3:06
  9. 3:06
  10. 3:06

Reasons why answers differ:

Image captions:

  1. A radio with a digital screen and numbers on top of it
  2. A screen showing a time, surrounded by a silver casing and buttons.
  3. A talking electronic item with blue buttons and a display screen
  4. an electronic machine with buttons and a digital numerical display
  5. Pictured is a clock or timer designed to keep start and end time.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 16: VizWiz_train_00011326.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a bottle of pizza on a table.

Visual question: What is this?

Answers:

  1. beef chuck steak
  2. beef chuck steak
  3. beef chuck steak
  4. beef chuck steak
  5. flat iron beef chuck steak
  6. beef chuck steak
  7. steak
  8. flat iron beef chuck steak
  9. beef chuck steak
  10. beef chuck steak

Reasons why answers differ:

Image captions:

  1. A medium sized package of seasoned chuck steak premade.
  2. A package of beef chuck steak, showing cooking and nutrition information.
  3. a package of Flat Iron beef chuck steak
  4. Quality issues are too severe to recognize visual content.
  5. the nutritional facts on the back of a bag

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_val_00000262.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Can you tell me the 1-800 number off this card, and are you able to hear this question?

Answers:

  1. 1 800 735 2929 cant hear q
  2. 1 877 328 9677
  3. 1 877 328 9677 1 800 735 2929
  4. 18773289677
  5. 1 877 328 9677
  6. 1 800 735 2929
  7. 1 877 328 9677 no
  8. 18773289677 no
  9. 1 877 328 9677
  10. 8007352929 no

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A yellow EBT card on a dark fabric surface.
  2. back of yellow Quest card with black text on it and a white empty signature box
  3. The back of a California EBT debit card.
  4. The back of an EBT card that is placed on a black surface.
  5. The backside of a beige EBT card with a magnetic strip.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_train_00005692.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dark picture of a black wall.

Visual question: What color is this?

Answers:

  1. black
  2. black
  3. black
  4. unsuitable
  5. black
  6. black
  7. unsuitable
  8. black
  9. black
  10. black

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 19: VizWiz_train_00005352.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A stack of books with a book on it.

Visual question: Is this benadryl?

Answers:

  1. yes
  2. unanswerable
  3. yes
  4. yes
  5. yes
  6. yes
  7. unanswerable
  8. yes
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A white box of an allergy medicine and the box is opened.
  2. A white box with red outlines and blue text.
  3. The back of a package that contains ingredients and warnings about the product
  4. the back of an open box of benadryl tablets
  5. The back side of a box of medication in a person's hand.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 20: VizWiz_train_00022109.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A medium size or half a jug of milk.
  2. A person holding a half gallon jug of whole milk.
  3. A person holds a quart jug of dairy milk in their hand.
  4. A person is holding a white bottle in his left hand
  5. a person sitting and holding a bottle of milk with a red cap

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 21: VizWiz_train_00018489.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer that is on a table.

Visual question: What is this product?

Answers:

  1. red chile sauce
  2. chile sauce
  3. can medium red chile sauce
  4. red chile sauce
  5. chile sauce
  6. red chile sauce medium
  7. chile sauce
  8. red chile sauce
  9. chile sauce
  10. red chili sauce

Reasons why answers differ:

Image captions:

  1. A can of chili sauce is resting on the fabric.
  2. A can of Las Palmas medium red chile sauce is laying on a beige towel.
  3. A can of Las Palmas Red Chile Sauce is sitting on a towel.
  4. The side of a can with a yellow label stating "Chile Sauce" and having a picture of a bowl of red sauce.
  5. Yellow cab of Las Palmas red chile sauce

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_train_00004805.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a man on a wall.

Visual question: What's in this picture?

Answers:

  1. bikes buses
  2. motorcycle bike bus another vehicle
  3. motorcycle bicycle bus car
  4. motorcycle bus bicycle cart
  5. homework
  6. transportation images
  7. different forms transportation
  8. motorcycle bike bus car
  9. transportation methods
  10. motorcycle bus bicycle car

Reasons why answers differ:

Image captions:

  1. a kids coloring page with different kinds of transportation on it
  2. A piece of paper has objects for coloring in
  3. A white paper has pictures of a bike and bus on it.
  4. A worksheet is covered with several modes of transportation.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00001953.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person ' s hand.

Visual question: Can you see what's in this can?

Answers:

  1. unsuitable
  2. no
  3. no
  4. unsuitable
  5. unsuitable
  6. no
  7. no
  8. no
  9. no
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A finger has been shown in the picture.
  2. a persons thumb pushing into a tab on a metal container
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Top of a circular gold can with a red label.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 24: VizWiz_train_00015129.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bed on a blanket.

Visual question: What color is this polo shirt?

Answers:

  1. yellow
  2. golden
  3. yellow
  4. yellow darker yellow leaves
  5. yellow tan
  6. yellow
  7. tan leafy design
  8. gold
  9. brown
  10. goldish yellow patterned

Reasons why answers differ:

Image captions:

  1. a t shirt with a unique pattern on it to wear
  2. a tan and cream floral design polo shirt
  3. A tan and gold blouse with a pattern of leaves is laid out.
  4. A yellow shirt with yellow leaves printed on it.
  5. Picture of a yellow flower shirt button up.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_train_00011904.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A glass of red wine on a table.

Visual question: Red or white?

Answers:

  1. red
  2. red
  3. red
  4. red
  5. red
  6. red
  7. red
  8. red
  9. red
  10. red

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up picture of a mostly empty glass of red wine.
  2. A glass with wine in it is sitting on a table.
  3. a half empty glass of red wine on a crowded table
  4. a nice wine glass with some red wine in it.
  5. A wine glass with a bit of red wine in it is sitting on a dinner table.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 26: VizWiz_val_00002214.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a bottle of food.

Visual question: I'd like to see if you can identify this tube. Some sort of cream.

Answers:

  1. unsuitable
  2. no
  3. hydrocortisone
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unsuitable
  10. poison oak poison ivy

Reasons why answers differ:

Image captions:

  1. A hand holding a tube of skin ointment
  2. A person holding up a tube of cream for cuts.
  3. A small tube of some medical ointment in a person's hand.
  4. A white tube of cream or ointment being held by a hand
  5. The back side of a tube of toothpaste, which is held by a person's hand.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00004954.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a birthday cake with a dog.

Visual question: How many calories in one serving?

Answers:

  1. 50
  2. 150
  3. unanswerable
  4. 150
  5. unanswerable
  6. unanswerable
  7. 100
  8. 20
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A bag of orange-colored, Cheddar Jalapeno Cheetos lies on a marble surface.
  2. An orange and green plastic bag of Cheetos with a cartoon picture of Chester Cheetah on it.
  3. Appears to be a picture of a chip bag
  4. Bag of Cheetos Cheddar Jalapeno flavor Crunchy Cheese doodles.
  5. in this picture is a image of bag of chips

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_val_00005788.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A wooden table has dark streaks on it.
  2. In this picture is a wall with red stripes panels
  3. Quality issues are too severe to recognize visual content.
  4. The grain of a light colored piece of pine wood is the focus of the picture.
  5. The striping of a wood surface is shown.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00019451.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on a table.

Visual question: Whats in the box?

Answers:

  1. grooming products
  2. lotion
  3. body wash
  4. shower duo
  5. unsuitable
  6. mens beauty products
  7. men hygiene products
  8. unanswerable
  9. unanswerable
  10. gchomme

Reasons why answers differ:

Image captions:

  1. A blister pack of the GC Homme Shower Duo product containing gel and a washcloth.
  2. a package of GC home shower wash and gel
  3. A package of shower products are on a table.
  4. A shower set containing shower gel on the left and an additional cleaner on the right hand side.
  5. Here is a gc homme brand duo package of unopened blue and black colored tubes of shower cleansers.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 30: VizWiz_train_00006835.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a window of water.

Visual question: what does the screen say?

Answers:

  1. unsuitable
  2. unanswerable
  3. nothing
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. unanswerable
  8. nothing
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A flash on the screen maybe from a dark selfie in a mirror
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 31: VizWiz_train_00022756.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a Nokia cell phone has text in Arabic
  2. a Nokia cell phone on a desktop, with a physical keyboard
  3. a Nokia phone that is very old and dusty
  4. A Nokia phone with keyboard sitting on a surface.
  5. Nokia BlackBerry cellular telephone - possibly with a bilingual alphabet containing both English letters and possibly Arabic

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 32: VizWiz_train_00012343.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a toothbrush.

Visual question: What brand of potatoes are these?

Answers:

  1. simply potatoes mashed potatoes
  2. simply potatoes
  3. simply potatoes
  4. simply potatoes
  5. simply potatoes
  6. simply potatoes traditional mashed potatoes
  7. simply
  8. unsuitable
  9. simply potatoes
  10. simply potatoes

Reasons why answers differ:

Image captions:

  1. A hand with a gold wedding band is above a package of Simply Potatoes.
  2. a package of simply potatoes branded mashed potatoes
  3. A person is holding a package of potatoes in their hand.
  4. Quality issues are too severe to recognize visual content.
  5. Someone is touching a green and white package of potatoes.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00011371.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A painting of a vase on the wall.

Visual question: Test Testing Testing

Answers:

  1. door
  2. unanswerable
  3. glass
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A frosted glass panel with a fleur-de-lis pattern in gray and the whole trimmed with a gold metal.
  2. A glass door has an etched insignia art piece on it.
  3. A screen door outside of a wooden door and some tiles on the left side.
  4. A shower door that has clear glass and a grey patterned leaves on it
  5. A stained white wall, including some tile, has a design in relief.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 34: VizWiz_train_00011840.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bedroom with a bed and a tv.

Visual question: Is it clear or dark in this room?

Answers:

  1. dark
  2. dark
  3. clear
  4. dark
  5. dark
  6. dark
  7. unsuitable
  8. dark
  9. clear
  10. dark except flash

Reasons why answers differ:

Image captions:

  1. A bed with blue checkered sheets with a desk and television in the background.
  2. a bedroom with a blue and white sheet.
  3. An blue light cobalt blue color stripe blanket, draped with a person in bed.
  4. Here is a picture of a room with a bed, desk and TV.
  5. Someone lying down on a bed covered in a blanket in a room with other blankets folded.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 35: VizWiz_train_00012263.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person is sitting on top of a bed.

Visual question: What color is this T Shirt?

Answers:

  1. white red yellow
  2. white red yellow graphic on front
  3. white
  4. white red
  5. white
  6. white
  7. white red white graphic
  8. white
  9. white
  10. white

Reasons why answers differ:

Image captions:

  1. a shirt with a graphic on the front on top of a bed
  2. a t-shirt with a design sitting on a bed
  3. a white t shirt is on top of a bed and a person is in front
  4. Two legs in front of a bed with white sheet and a white fabric with words on it in a red background.
  5. various linens and clothing with someone's lower legs next to them

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 36: VizWiz_train_00016447.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bag of food.

Visual question: Could you tell me what's in this package please?

Answers:

  1. yellow rice
  2. yellow rice
  3. yellow rice
  4. yellow rice
  5. yellow rice
  6. rice
  7. candy
  8. yellow rice
  9. yellow rice
  10. yello rice

Reasons why answers differ:

Image captions:

  1. A bag of Vigo brand yellow rice with saffron.
  2. a food item which is packed contains rice which is ready to cook
  3. A yellow package of Vigo brand yellow rice.
  4. Quality issues are too severe to recognize visual content.
  5. The front side of a small yellow package of yellow rice.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 37: VizWiz_train_00006845.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue and white hat.

Visual question: What is the brand of this shirt?

Answers:

  1. unanswerable
  2. unanswerable
  3. new balance
  4. unanswerable
  5. n
  6. unsuitable
  7. nike
  8. nautica
  9. unanswerable
  10. new balance

Reasons why answers differ:

Image captions:

  1. A New balance shoe has the laces untied and a blue label.
  2. a picture of the back of a new balance shoe.
  3. Blue, navy, and white unlaced New Balance brand sneaker
  4. the back of a black and white new balance brand shoe
  5. The back of a light and dark blue shoe.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 38: VizWiz_train_00010611.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue sky with a cloudy background.

Visual question: Please tell me what color this dress is, thank you

Answers:

  1. blue
  2. blue
  3. blue
  4. blue
  5. blue
  6. blue
  7. blue
  8. blue
  9. blue
  10. blue

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue article of clothing or cushion on a mattress.
  2. A piece of blue fabric with no discernible characteristics to it.
  3. Part of a piece of fabric with a crinkle in the bottom left corner.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 39: VizWiz_train_00015412.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a large white building.

Visual question: Are there any birds in the trees?

Answers:

  1. no
  2. unanswerable
  3. no
  4. no
  5. unsuitable
  6. no
  7. no
  8. no
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. A fence is standing up and some trees are behind it.
  2. A fence located in the backyard of a property.
  3. a fence with trees and tops of houses showing over the top of it
  4. A white fence right outside in the yard.
  5. White fence with a palm tree behind it and a pot or bucket in front of it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 40: VizWiz_train_00013161.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat that is laying down in a room.

Visual question: What kind of dog is this?

Answers:

  1. westie
  2. unsuitable
  3. yorkie
  4. terrier
  5. white terrier
  6. scottish terrier
  7. pomeranian
  8. lap dog
  9. mut
  10. terrier

Reasons why answers differ:

Image captions:

  1. a white hairy dog moving in the room
  2. I see a dog that is walking around the kitchen
  3. Partial view of a medium length coat white haired dog.
  4. small very hairy dog that is standing by wall
  5. Small, cute white dog in the corner of a room, beige walls.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00013438.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a pair of scissors.

Visual question: What kind of taco is this?

Answers:

  1. unanswerable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. image plurry
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A close up of some kind of red and white food package with bright green lettering.
  2. Products that It either a food product or something different than that.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The bottom corner of packaging for an unrecognizable product.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_val_00000217.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is the person doing?

Answers:

  1. looking at computer monitor
  2. sitting at desktop facing away
  3. typing
  4. on computer
  5. working on computer
  6. they on computer
  7. working on computer
  8. watching video
  9. looking at computer
  10. looking at computer screen

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A girl is sitting at a desk looking at a computer monitor
  2. A person in a green sweater is sitting near a desk with a computer monitor and papers on top.
  3. A person wearing green shirt is sitting in front of the computer.
  4. A woman in a green shirt is sitting at a desk using a computer
  5. A woman in a green shirt is using the computer at a desk.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 43: VizWiz_train_00005076.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: can you see what this is?

Answers:

  1. unsuitable
  2. no
  3. yes
  4. unsuitable
  5. no
  6. can
  7. no label facing your palm image blurry
  8. unanswerable
  9. unanswerable
  10. yes

Reasons why answers differ:

Image captions:

  1. A can of food showing cooking instructions on the back label.
  2. A person is holding a can of food in a room with a toaster oven placed on a wooden surface and other kitchen tools on a rack.
  3. A person is holding the back of a can of food.
  4. Someone in a kitchen holds a can of food.
  5. The ingredient and nutritional information section on the back of a can of food.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_val_00004163.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a clock on a wall.

Visual question: what is this thermostat temperature set at

Answers:

  1. unsuitable
  2. i dont know
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. 78
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A small white object with buttons and a display window is displayed to one side of the picture.
  2. A white thermostat is placed on an orange toned wall.
  3. A white wall mounted thermostat with a black pasted note under the unit.
  4. Quality issues are too severe to recognize visual content.
  5. Thermostat on a tan wall in a house with no text displayed on the screen.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 45: VizWiz_train_00011440.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A piece of bread is sitting on a table.

Visual question: Does this cheese have mold?

Answers:

  1. yes
  2. no
  3. no
  4. no
  5. no
  6. no
  7. yes
  8. yes white mold at bottom cheese
  9. yes
  10. just tiny bit cut off itll be fine

Reasons why answers differ:

Image captions:

  1. A block of yellow cheese with a bite on it.
  2. A piece of cheese is laying on top of a paper towel.
  3. A SMALL SQUARE PIECE OF PARTIALLY BITTEN CHEESE.
  4. A square block of yellow cheese has a bite taken out of it and is on a white paper towel.
  5. A square of yellow cheese, with a corner ripped off lays on a paper towel

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 46: VizWiz_train_00007126.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A sign that is sitting on a table.

Visual question: Say what this is?

Answers:

  1. frozen dinner
  2. frozen meal
  3. fiesta lime chicken seasoned rice
  4. fiesta lime chicken seasoned rice
  5. fiesta lime chicken seasoned rice
  6. seasoned
  7. fiesta lime chicken seasoned rice
  8. rice mix
  9. box rice
  10. chicken rice

Reasons why answers differ:

Image captions:

  1. A cardboard box of chicken and rice is crumpled
  2. Box of fiesta lime chicken with seasoned rice
  3. In this picture is a image of a pack of rice
  4. package of fiesta lime chicken season rice in a bag
  5. Pictured is a box of grilled white meat chicken strips.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00004321.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a sign with an advertisement on it.

Visual question: What is the title of this book, please?

Answers:

  1. nowa anemia
  2. nowa anemia: prezebudzenie پ wiadomoپ ci sensu پ ycia
  3. nowa anemia
  4. nowa anemia
  5. iowa siemia prezebudzenie swiadomosci sensu zycia
  6. nowa anemia
  7. nowa anemia
  8. nowa anemia
  9. nowa ziema
  10. nowa anemia

Reasons why answers differ:

Image captions:

  1. A book by Eckhart Tolle called Nowa Ziemia.
  2. A book called Nowa Ziemia by Eckhart Tolle
  3. a book that is orange and yellow blended with blue stripe at the bottom and writing
  4. An orange book called Nowa Ziemia written by Eckhart Tolle.
  5. Eckhart Tolle NOAA satellite imagery and 3D buildings and terrain.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00018580.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee sitting on top of a counter.

Visual question: Ask the question. What is this?

Answers:

  1. whole milk
  2. organic valley whole milk
  3. milk
  4. organic valley whole milk
  5. organic valley whole milk
  6. whole milk
  7. organic valley whole milk
  8. organic valley whole milk
  9. organic valley whole milk
  10. could only be organic valley whole milk

Reasons why answers differ:

Image captions:

  1. a box of organic valley brand whole milk
  2. A carton of milk on top of the tile floor
  3. A container of organic valley milk Is on a counter.
  4. A red and white carton of organic valley whole milk, placed on a tile countertop.
  5. I see a carton of organic milk and it's red.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00019775.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car in a room.

Visual question: What is this?

Answers:

  1. kaleidascope
  2. unsuitable
  3. kaleidoscopic image
  4. geometric
  5. unanswerable
  6. kaleidoscope
  7. kaleidoscope
  8. unanswerable
  9. unsuitable
  10. crystal

Reasons why answers differ:

Image captions:

  1. A black and blue design is on a geometric screen.
  2. A unique picture of what could be a computer board or the inside of a spaceship.
  3. an abstract mirrored blue and black art image
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_val_00006348.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A hand holding the beginning of a meter
  2. a small black object that measures just under one inch long
  3. A small black rectangular piece approximately 7/8th of an inch per the yellow metal tape measure.
  4. A tape measure measuring a small black object at 7/8th of an inch.
  5. Measuring an object using yellow tape and showing less than one inch.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.