Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00000388.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What does this say?

Answers:

  1. unsuitable image
  2. pc
  3. unknown
  4. unanswerable
  5. unsuitable
  6. cant read
  7. unsuitable image
  8. unsuitable image
  9. unsuitable image
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A computer monitor with using the Windows operating system.
  2. A computer screen with a Windows blue screen open and Vaio at the top.
  3. a windows computer setup screen asking for the time zone
  4. computer screen light blue background words in white
  5. in this image the windows computer vaio screen

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00003236.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car on a table.

Visual question: Can you tell me if there is any mold on this side of this cheese?

Answers:

  1. yes
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. no
  6. unsuitable
  7. no
  8. unanswerable
  9. yes
  10. yes

Reasons why answers differ:

Image captions:

  1. A cellophane wrapper with a yellow object inside of it.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Yellow object inside a plastic bag on top of a table.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 3: VizWiz_train_00018308.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a frisbee in a cup.

Visual question: What is this?

Answers:

  1. black beans
  2. black beans
  3. beans
  4. black beans
  5. this can black beans
  6. black beans
  7. black beans
  8. can black beans
  9. as
  10. black beans

Reasons why answers differ:

Image captions:

  1. a blue can of black beans being held by a person in their left hand
  2. A person holding a can of black beans.
  3. A white hand holds up a blue can of black beans.
  4. someone holding a can of black beans in their hand
  5. Someone holding up a blue can of whole black beans.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00007092.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A red phone sitting on top of a sidewalk.

Visual question: What flavor is this?

Answers:

  1. peach mango
  2. peach mango
  3. peach mango
  4. peach mango
  5. peach mango
  6. peach mango
  7. mango
  8. peach mango
  9. peach mango
  10. peach mango hand soap

Reasons why answers differ:

Image captions:

  1. a glass bottle of peach mango HAND SOAP
  2. bottle of peach mango hand soap that contains 13 and a half fluid ounces
  3. It looks like a peach mango hand soap.
  4. Quality issues are too severe to recognize visual content.
  5. The bottom part of a soap bottle on a flat surface.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 5: VizWiz_train_00004005.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a picture of a tile floor.

Visual question: I would like to know if this picture I am sending you now goes with this shirt that I sent you a few minutes ago..

Answers:

  1. unsuitable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A carpet that is brown with beige dots on it.
  2. A closeup of some kind of fabric or material with cream colored dots and a brown background.
  3. Appears to be a piece of fabric with yellow and tan swirled pattern
  4. Floor covered in tan carpet with off white dots
  5. The fabric is grey with light spots that are in a circular pattern.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_train_00005036.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle on a table next to a book.

Visual question: What is this?

Answers:

  1. lotion
  2. body lotion
  3. lotion
  4. lotion
  5. unsuitable
  6. container
  7. unsuitable
  8. unanswerable
  9. body lotion
  10. lotion

Reasons why answers differ:

Image captions:

  1. A bottle of lotion and flat on the bed
  2. A gold plastic bottle of replenishing body lotion.
  3. A small tube of a body lotion that looks about the same size as you would get at a hotel.
  4. a small tube of replenishing body lotion laying on its side
  5. appears to be a picture of a yellow bottle

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_val_00004547.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A double CD case sitting on a person's lap
  2. A DVD with some nature scenes sitting on someone's legs
  3. A nature DVD about waterfalls and other sights is laying on someone's lap.
  4. imagine how you would describe this image on the phone to a friend.
  5. someone sitting down and there is a card on their lap

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00004349.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a chair and chairs.

Visual question: What is this?

Answers:

  1. hallway leading to room dining chairs
  2. hallway chair
  3. chair
  4. hallway
  5. entryway to room
  6. floor
  7. room chair
  8. hallway
  9. room
  10. hallway

Reasons why answers differ:

Image captions:

  1. A dining room showing a chair against the wall under a light switch.
  2. A room with tile flooring, a few chairs visible, and a light switch.
  3. A view down a hallway with an arch, tile, and a chair.
  4. An arched hallway with a view into a living room in the background with an armless dining chair as well as a portion of an armed dining chair sitting on square terra cotta tiles.
  5. The inside of a Spanish style building with a wood chair.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_val_00007260.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A glass jar containing Musselman's brand Apple Butter spread
  2. A jar of Musselman's Apple Butter on a counter.
  3. an unopened jar of Musselman's brand apple butter
  4. Jar of apple butter sauce with tomatoes on the label
  5. Musselman's Apple butter bottle with cap is on the floor.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 10: VizWiz_val_00003471.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a table.

Visual question: What is this?

Answers:

  1. orange juice
  2. juice
  3. juice
  4. orange juice
  5. juice
  6. smooth juicy
  7. smooth n juicy
  8. juice
  9. orange juice
  10. juice

Reasons why answers differ:

Image captions:

  1. a box of Smooth Juicy juice with oranges on the label
  2. A package of juice is on top of a table.
  3. Dawn brand Smooth juicy orange flavored juice box.
  4. Here is a picture of a smooth juicy in a box
  5. Juice in a blue, white, and orange carton.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_val_00006498.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A box of pasta showing a bowl of it on the package.
  2. a brown and yellow box of noodles with a graphic of prepared noodles and a red tomato on the front
  3. A container of pasta is in front of speakers for a computer
  4. Part of a box of Barilla brand pasta is shown.
  5. The upper right part of a box of pasta.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_val_00002766.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a vase with flowers.

Visual question: Is this stripes or flowers?

Answers:

  1. flowers
  2. flowers
  3. flowers
  4. flowers
  5. flowers
  6. flowers
  7. flowers
  8. flowers
  9. flowers
  10. flowers

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A blue fabric has white and green flowers on it.
  2. a blue green and white cloth with design on it
  3. A green and blue floral pattern on a type of fabric.
  4. Bright blue fabric with green and white Hawaiian floral pattern.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_train_00017031.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a green and white light.

Visual question: What's the label on this tiny bottle?

Answers:

  1. unanswerable
  2. unanswerable
  3. 0
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. 0
  8. unanswerable
  9. unsuitable
  10. this light clar

Reasons why answers differ:

Image captions:

  1. A black leather upholstered furniture with a green cloth draped on the side of it.
  2. A bowling ball that a person just bought for their league
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 14: VizWiz_val_00001026.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: what is this?

Answers:

  1. unsuitable image
  2. paper bill letter
  3. unsuitable image
  4. unanswerable
  5. unsuitable image
  6. letter
  7. letter
  8. unsuitable image
  9. paper
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A hand is holding a document up in the air.
  2. A person is holding up a document with the text facing the other direction.
  3. A wonderful view of the fog windows in the room is very thick
  4. Personal information is shown on this piece of paper.
  5. White letter sized piece of paper with black lettering.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 15: VizWiz_train_00009283.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black mouse sitting on top of a wooden table.

Visual question: What type of mouse is this?

Answers:

  1. dell wired computer mouse
  2. dell
  3. dell
  4. dell
  5. dell
  6. dell
  7. computer
  8. dell
  9. dell
  10. dell

Reasons why answers differ:

Image captions:

  1. A BLACK AND GREY MOUSE ON A DESK
  2. A black and silver Dell computer mouse sitting on a wood grain surface.
  3. A wired Dell computer mouse on a wooden surface.
  4. Black dell mouse with grey buttons on a light brown table
  5. gray and black Dell computer mouses on a wood table

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 16: VizWiz_val_00007422.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Cooking instructions and precautions for opening the product
  2. Cooking Instructions were photographed at a very close range.
  3. I see a stove top cooking with 15 min on it
  4. Quality issues are too severe to recognize visual content.
  5. The cooking instruction for a container of food meant to be boiled for 15-20 minutes or microwaved for 10 minutes.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00006617.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dark up of a black sky at night.

Visual question: What color is this?

Answers:

  1. black
  2. black
  3. unsuitable
  4. black
  5. black
  6. black
  7. black
  8. black
  9. unsuitable
  10. black

Reasons why answers differ:

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_train_00023082.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. An information panel of an exercise machine has some digital displays some numbers, as well as some buttons controlling incline and speed.
  2. Quality issues are too severe to recognize visual content.
  3. The controller is black with a handle and has a UK sticker on the front.
  4. The interface of a machine with various grey buttons and digitized numbers on it along with a UK flag to one side of it
  5. Upper right part of a machine used for working out.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00022513.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A CAPTCHA prompt asking you to enter the characters jmxvmt.
  2. A close up of a computer monitor with a captcha on the screen.
  3. Computer screen asking someone to type the above characters in a box.
  4. computer screen shot of a captcha that says jmxvmt
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 20: VizWiz_train_00020557.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A Keurig K-cup of Green Mountain Coffee is on a table.
  2. a single serve shot of Green Mountain coffee
  3. Green Mountain Coffee 'Spicy Eggnog' Keurig k-cup pod.
  4. Keurig k-cup by Green Mountain Coffee on a wooden table
  5. Scratched and water-marked wooden table with unopened coffee pod on top

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_val_00004074.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person wearing a tie and a pair of shoes.

Visual question: What color are these shoes?

Answers:

  1. brown
  2. black
  3. brown
  4. brown
  5. brown
  6. brown
  7. brown
  8. foot
  9. brown
  10. briwn

Reasons why answers differ:

Image captions:

  1. a brown sandal on a foot on a wood floor
  2. A person's foot wearing a brown sandal on a wooden surface.
  3. Foot wearing dark leather sandal with white stitching.
  4. Quality issues are too severe to recognize visual content.
  5. Someone has taken a photograph of their foot, in a brown sandal, standing on a wood grained floor.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_train_00020210.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A frozen dinner meal containing spinach artichoke chicken.
  2. A person is holding a package of frozen spinach artichoke chicken.
  3. Box of Culinary Creations Spinach Artichoke Chicken sandwich pockets.
  4. Front of a box for Spinach Artichoke chicken sandwiches
  5. Rotate the box for proper viewing and focus the lens more to get a clear picture.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00009142.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cloudy sky with clouds.

Visual question: Can you tell me what this item is, please?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. page
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A white paper with blue font on top of it
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 24: VizWiz_train_00006451.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog laying on the floor with a skateboard.

Visual question: What is this object?

Answers:

  1. dog
  2. black lab
  3. dog
  4. dog
  5. dog
  6. service dog
  7. dog
  8. dog
  9. dog
  10. dog

Reasons why answers differ:

Image captions:

  1. A black dog in a brown harness laying on the carpet.
  2. A black dog laying on the ground with a harness on.
  3. a black lab dog laying on a brown carpet
  4. A black Labrador dog with a brown harness on him, the dog is laying on the carpet.
  5. a black service dog sitting on brown and tan carpeted floor

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_val_00002712.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A yellow dog is on a table with an orange bear.

Visual question: What is this?

Answers:

  1. werfgferg
  2. coin bank
  3. piggy bank
  4. coin bank
  5. piggy bank
  6. piggy bank
  7. penny bank
  8. piggy bank
  9. piggy bank
  10. piggy bank

Reasons why answers differ:

Image captions:

  1. An animal shaped piggy bank with butterflies and orange and green stripes
  2. Ceramic pig wearing a striped sweater penny bank with slot on top.
  3. IMAGE WAS CLEAR BUT IT WAS NOT ITEM
  4. piggy bank ceramic figure with green and yellow stripes and a purple butterfly
  5. SMALL GREEN, ORANGE, BROWN AND TAN PIGGY BANK

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00015976.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A woman with a cell phone on it.

Visual question: What does this say?

Answers:

  1. unanswerable
  2. unsuitable
  3. dvd
  4. unanswerable
  5. unsuitable
  6. cannot reat
  7. unanswerable
  8. not sure
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A black DVD box with a portrait of a woman on it.
  2. A box with a face on it that says DVD on it laying on a flat surface.
  3. A digital video disc box showing the front cover image and name.
  4. A DVD with an image of a blonde woman on the cover sits on a wooden table.
  5. An DVD box of an unknown title laying flat on the floor.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_train_00019746.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book is sitting on top of a table.

Visual question: What is in this box?

Answers:

  1. fice
  2. cake mix
  3. gt
  4. unsuitable
  5. apple mix
  6. tastefully simple apple mix
  7. unanswerable
  8. dessert crust
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A box of apple mix and two table mats sit on a wooden table.
  2. A box of apple pie mix lays faced-up on a dark wooden table.
  3. a dessert food mix is on a dinner table with green clothes behind it.
  4. A red box of apple crumble mix that says to just add apples and eggs.
  5. An unopened package of an apple baked good mix.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00000126.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person on a table.

Visual question: What is this?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. finger
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A orange on a gray cloth with the photographer blocking half the camera
  2. Finger covers camera lens so that the photo is not visible.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00023724.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Is bright light odd?

Answers:

  1. no
  2. yes
  3. no
  4. no
  5. no
  6. unanswerable
  7. unanswerable
  8. no
  9. unanswerable
  10. no

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 30: VizWiz_train_00023365.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cd case of country music sitting on a person's lap.
  2. A copy of a Ralph Stanley cd that still appears to be wrapped.
  3. A man has a CD in his lap.
  4. a Ralph Stanley CD case resting on someone's knees
  5. someone holding up on a cd in their lap

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_train_00023739.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What is this picture of?

Answers:

  1. can collard greens
  2. can collard greens on table
  3. collard greens
  4. tin
  5. collard greens
  6. collard greens
  7. can collard greens
  8. can collard greens
  9. collard greens
  10. collard greens

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 32: VizWiz_train_00009057.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of food next to a plate.

Visual question: What flavor yogurt is this?

Answers:

  1. peach
  2. unsuitable
  3. peaches
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. peach

Reasons why answers differ:

Image captions:

  1. A container of some sort of pudding or jello is resting on the rug.
  2. A container of yogurt is on a white and red tablecloth.
  3. A picture of the food is on the packaging.
  4. An unopened pudding containers n sitting on a striped red, white, and gray sheet.
  5. Top view of a cup of yogurt that shows the weight and flavor.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00006476.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard on a table.

Visual question: What is this?

Answers:

  1. keyboard
  2. white keyboard
  3. keyboard
  4. keyboard
  5. keyboard
  6. keyboard
  7. keyboard
  8. keyboard
  9. keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. A silver and white keyed keyboard with a yellow label that says "CAT".
  2. A white or silver keyboard with a yellow sticker on it that reads CAT.
  3. silver computer keyboard with a yellow sticker on it.
  4. Silver computer keyboard with gray keys and yellow CAT sticker in the upper right corner.
  5. The center portion of a white computer keyboard with gray keys.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00023094.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a computer keyboard with wires running across it
  2. A keyboard with a cable in front of it.
  3. A white keyboard with a white cord on top on a desk
  4. Quality issues are too severe to recognize visual content.
  5. White color keyboard and near a white color wire.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 35: VizWiz_train_00019836.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A book with a bunch of books on it.

Visual question: What is this? It's Italian. Read as you can.

Answers:

  1. unsuitable
  2. rio salmone
  3. unsuitable
  4. unsuitable
  5. salmone ionomoy image blurry
  6. sausage
  7. this rio
  8. salmon
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a box of frozen salmon with the nutrition label on the back
  2. a brown color Rio drug sachet on a white surface
  3. Quality issues are too severe to recognize visual content.
  4. The back of a cardboard food container listing ingredients and directions.
  5. The back of a food package with heating instructions and ingredients it is orange and showing barcode

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 36: VizWiz_train_00009175.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle on a wall.

Visual question: What type is this?

Answers:

  1. cassette tape
  2. creating health
  3. tape
  4. unanswerable
  5. cassette tape
  6. cassette tape
  7. creating health 1 cassette tape
  8. creating healthy
  9. cassette
  10. cassette tape

Reasons why answers differ:

Image captions:

  1. A cassette tape of Creating Health is sitting on top of a red fabric surface.
  2. A clear cassette tape labelled "Creating Health" and tape one.
  3. A clear cassette tape on top of a red surface.
  4. A tape-deck tape, first of a series titled creating health.
  5. A transparent VHS tape with words 'creating health' written on it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_train_00006475.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a red and orange wall.

Visual question: What color?

Answers:

  1. orange
  2. orange
  3. orange white
  4. red
  5. bright orange
  6. orange
  7. red
  8. orange red
  9. orange white
  10. orange

Reasons why answers differ:

Image captions:

  1. A bright orange terry cloth fabric is laying on a white embroidered cloth.
  2. a bright orange towel atop of a white quilted textile
  3. A bright red cloth sitting on white, patterned cloth.
  4. An orange and white comforter where the orange side is made from a softer material.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 38: VizWiz_train_00018626.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person sitting on a wall.

Visual question: What is the definition for jewish?

Answers:

  1. i see fan guitar case
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. religion
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. a standing fan and a Guitar bag beside it
  2. a standing fan in front of a window in someones room
  3. A standing fan is next to a guitar case in front of two windows with blinds
  4. The fan is sitting near the windows with the blinds.
  5. White fan on a stand in front of a window with the blinds closed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 39: VizWiz_train_00002687.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator on a table.

Visual question: What is he?

Answers:

  1. unanswerable
  2. no man in photo
  3. unanswerable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. can
  10. drink

Reasons why answers differ:

Image captions:

  1. a canned beverage sits on a wooden surface beside a remote control and a extension cord that has multiple plugs inserted.
  2. A canned beverage with Japanese characters on it.
  3. a cylindrical can placed on a wooden table
  4. a drink in tin container, an electrical point which is used to plug in
  5. A drinkable can of product that is brown and grey on a wooden surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 40: VizWiz_train_00015596.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a wall with a wooden door.

Visual question: What am I looking at?

Answers:

  1. wall
  2. wall
  3. wall
  4. wall part picture
  5. wall
  6. walls
  7. wall
  8. unsuitable
  9. wall
  10. wall framed art

Reasons why answers differ:

Image captions:

  1. A partial image of what may the corner of a picture frame on a wall.
  2. Edge of a wooden frame on a white wall
  3. Not a good image and image has drawbacks in quality.
  4. Quality issues are too severe to recognize visual content.
  5. Wooden object in the right hand corner in front of a white wall.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 41: VizWiz_train_00013893.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a book in front of a computer.

Visual question: How much of this page is visible and how skewed is the text? Thank you.

Answers:

  1. about 4 5 page visable text isnt skewed
  2. 80% slightly
  3. almost all text blurry
  4. 3 4 not very slanted
  5. almost all
  6. bottom 75 percent
  7. few lines on top missing blurry far away
  8. partial page
  9. mostly visible
  10. maybe half but i am still unable to read

Reasons why answers differ:

Image captions:

  1. A book with several pages and words inside of it.
  2. a textbook or book that you can read that has text on it
  3. Middle part of a book being held by someone against a table.
  4. Pictured is a hand holding open a book.
  5. The left side page of an opened novel with a red cover.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_val_00004379.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A zoom in of a red object against a dark object
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Whitish pink background without any other object; appears overexposed.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 43: VizWiz_train_00006272.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A toilet with a hat on top of it.

Visual question: What is this item?

Answers:

  1. conditioner
  2. shampoo
  3. conditioner
  4. pantene pro v conditioner
  5. soap
  6. conditioner
  7. shampoo
  8. pantene pro v
  9. conditioner
  10. lotion

Reasons why answers differ:

Image captions:

  1. A bottle of Pantene conditioner laying on a dark surface
  2. a cream colored plastic container of Pantene Pro-V conditioner
  3. A dented bottle of Pantene conditioner lies on a dark surface near a Christmas ribbon.
  4. A gold and green ribbon by Pantene conditioner.
  5. a large bottle of Pantene brand moisture renewal conditioner

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 44: VizWiz_train_00020466.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A lit living room with entertainment center in the corner, children's toys and pillows scattered across the floor, and a sliding glass door to the left.
  2. a messy floor covered in kids toys and pillows
  3. A toddler's colorful riding car in a living room of toys.
  4. Floor toys are laid out on a carpet
  5. I can tell this room is where children come to play based on the placement of the toys.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 45: VizWiz_train_00007345.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a book on a table.

Visual question: What page number is on this page?

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. 9
  8. unanswerable
  9. unanswerable
  10. 9

Reasons why answers differ:

Image captions:

  1. A book open on the table shown by the picture.
  2. a page in the middle of text book
  3. chapter nine in a book about languages in contact
  4. IMAGE WAS UNCLEAR BUT IT IS NOT ITEM
  5. Language is the subject for this educational looking book.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 46: VizWiz_val_00004586.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A grey color material in which some white dots.
  2. An off white textured fabric with raised bumps on it.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The surface of a blue carpet with vertical lines.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00000445.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cat on a bed.

Visual question: Which one of these is the sugar bag?

Answers:

  1. bag in front cigarettes
  2. true
  3. white 1
  4. unanswerable
  5. unsuitable
  6. left
  7. white
  8. front bag
  9. only 1
  10. 0

Reasons why answers differ:

Image captions:

  1. A large bag of chocolates is placed in front of an ashtray for cigarettes.
  2. A large, open bag of granulated sugar on a kitchen countertop, next to an ashtray that is half-full of cigarette butts.
  3. an opened package of sugar on a counter top with an ashtray full of cigarette butts, and a pot
  4. Side of a white bag with food label showing in cigarettes in the background
  5. The side of a bag of sugar and an ashtray.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00016724.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A remote control sitting on top of a table.

Visual question: What color is this?

Answers:

  1. black
  2. black grey
  3. black
  4. black
  5. black
  6. black
  7. black
  8. black silver
  9. black
  10. black

Reasons why answers differ:

Image captions:

  1. a black corded telephone on top of a hard white surface
  2. A standard wired telephone is in the image, with the handset on the left and the buttons and display on the right, The text in the image shows call information.
  3. A traditional style telephone with a digital display.
  4. Black and silver polycom 20 plus key with digital screen with caller ID
  5. Black Polycom IP phone in front of a heat register.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00004560.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person wearing a pink shirt.

Visual question: Can you tell me what shade of pink this fabric is?

Answers:

  1. light pink
  2. suttle hot pink
  3. light pink
  4. hot pink
  5. soft pink
  6. neon
  7. cotton candy pink
  8. hot pink
  9. coral
  10. pastel pink

Reasons why answers differ:

Image captions:

  1. A fancy embroidered edge surrounds a pink shirt or blanket.
  2. a pink piece of fabric with minimal creasing
  3. Pink fabric that is being used to make a dress
  4. Some bright pink material with dark purple thread along the seams.
  5. the corner of a neon pink cloth unknown item

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_train_00007858.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wooden board on a table.

Visual question: What's in this pen please? Thanks.

Answers:

  1. unanswerable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unsuitable
  6. unanswerable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A can of food with a barcode on the label and a piece of wood with some text on it are shown in the image.
  2. A can with a white label is next to a box.
  3. A canned food good of some kind on its side near a wooden box.
  4. A tin can of some kind of product on its side in front of a wood box.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Showing images 0 - 0 out of 0 matching images.