Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_val_00006781.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A grand & toy brand fluorescent yellow highlighter
  2. A laptop is open with a highlighter on top of it.
  3. a pen which is used to highlight the important words
  4. A yellow highlighter is laying near a keyboard.
  5. image shows a highlight marker and a laptop keyboard.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 2: VizWiz_train_00010530.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a black remote control.

Visual question: What is this?

Answers:

  1. calculator
  2. calculator
  3. calculator
  4. calculator jeans
  5. part calculator jeans
  6. remote
  7. calculator
  8. calculator
  9. jean chocolate
  10. calculator

Reasons why answers differ:

Image captions:

  1. A black calculator is sitting on someone's blue jean clad lap
  2. A blank calculator resting on a person's pants leg.
  3. A calculator on the leg of someone wearing blue denim jeans
  4. A partially visible black calculator, calculator screen is illegible, jean pants under the calculator.
  5. A person wearing jeans has a calculator in their lap.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 3: VizWiz_train_00009735.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A wall with a picture of a cat on it.

Visual question: What's the number?

Answers:

  1. 10
  2. 10
  3. 10
  4. 10
  5. 10
  6. 10
  7. 10 clubs
  8. 10
  9. 10
  10. card

Reasons why answers differ:

Image captions:

  1. A 10 of Clubs card from a standard deck of cards.
  2. A playing card featuring the number 10 and 10 Black club icons
  3. One ten of clubs card on a white surface.
  4. The 10 of clubs playing car laying on a shiny surface.
  5. The top half of a 10 of clubs card

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 4: VizWiz_train_00000672.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a mouse on a table.

Visual question: What is this?

Answers:

  1. computer mouse
  2. computer mouse
  3. mouse
  4. wire free mouse
  5. computer mouse
  6. computer mouse
  7. computer mouse
  8. mouse
  9. wireless computer mouse
  10. computer mouse

Reasons why answers differ:

Image captions:

  1. A silver and black computer mouse with a wheel.
  2. A silver computer mouse is sitting on the table.
  3. A wireless black and gray computer mouse on a tan background.
  4. A wireless computer mouse on a wood table.
  5. A wireless computer mouse with a scroll wheel and third button.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 5: VizWiz_train_00004119.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a man in a mirror.

Visual question: Who is the guy?

Answers:

  1. chiang kai shek
  2. mexican
  3. unanswerable
  4. ho chi mein
  5. unanswerable
  6. unknown
  7. historical figure
  8. unanswerable
  9. unanswerable
  10. picture

Reasons why answers differ:

Image captions:

  1. A picture of a man in a blue button up shirt.
  2. A picture of a man wearing a blue button up shirt
  3. An image of the Chinese historical figure Sun Yat-sen
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 6: VizWiz_train_00021519.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A black stove top that is being used for cooking.
  2. A closeup showing one dial, some buttons, and part of the timer on an oven.
  3. A timer that is right on the oven.
  4. An oven knob that is set to off and multiple buttons on the oven.
  5. The knob for the left rear burner as well as oven controls for a stove are the only things visible.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 7: VizWiz_train_00014091.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue sky.

Visual question: What color is my vest?

Answers:

  1. blue
  2. blue
  3. blue
  4. blue
  5. blue
  6. blue
  7. blue
  8. blue
  9. blue
  10. blue

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a bright blue textured fabric piece with folds
  2. A knitted textured fabric is dyed a light blue.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 8: VizWiz_val_00004133.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a window with a white background.

Visual question: What is in this can?

Answers:

  1. unsuitable
  2. unsuitable
  3. unsuitable
  4. unsuitable
  5. unanswerable
  6. unsuitable
  7. unanswerable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. I cannot Understand but It was Bright
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 9: VizWiz_train_00016691.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a stove with a remote.

Visual question: Hey what do you think about this?

Answers:

  1. jvc makes good stuff
  2. unanswerable
  3. cassette player
  4. wefff
  5. entertainment
  6. nice
  7. old boom box
  8. unanswerable
  9. jvc cd player
  10. stove

Reasons why answers differ:

Image captions:

  1. A big white stereo with multiple operating buttons
  2. A good looking audio playing device made by JVC.
  3. A JVC radio or car stereo, grey in color, has a tape deck and a CD player along with a dialing knob at top right.
  4. A wonderful view of the fog windows in the room is very thick
  5. Front controls and display for a white JVC stereo

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 10: VizWiz_train_00004880.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A bottle of beer sitting on the floor.

Visual question: What soda is this?

Answers:

  1. vanilla coke 0
  2. coke 0
  3. vanilla coke 0
  4. coke 0 vanilla
  5. coca cola 0
  6. coke 0
  7. coke 0
  8. coke 0
  9. coke 0 vanilla
  10. coke 0

Reasons why answers differ:

Image captions:

  1. A can of Coke Zero soft drink, showing the nutrition information panel.
  2. A can of Coke Zero vanilla showing the nutrition facts, sitting on a stove next to a wood counter
  3. A can of vanilla coke zero sitting on top of a stove
  4. Black Vanilla Coke Zero soda can on a black stove.
  5. the nutrition label of a can of vanilla Coke zero

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_train_00019897.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a wall with a mirror on it.

Visual question: What flavor is this?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A delivery notice for a package or letter, placed on a white surface.
  2. A packet of kool aid drink mix with the facts facing up.
  3. a small food packet laying with the label facing up on a white surface
  4. The back package of some sort of powdered drink.
  5. The nutritional information of a food packet can be seen laying on a white counter surface.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 12: VizWiz_val_00001575.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog wearing a green shirt is standing in a vase.

Visual question: Whats this shirt look like, thanks?

Answers:

  1. tie dye
  2. flowers
  3. like tye dye
  4. colorful
  5. white color stains
  6. tie dye
  7. multi color white background
  8. tie dyed
  9. white multi colored forms
  10. white green pink

Reasons why answers differ:

Image captions:

  1. A crew neck tee shirt with a tie-dyed pattern.
  2. A hand holds a white sweatshirt with a pink and green pattern on a wire hanger.
  3. A tied dyed shirt hanging on a metal hanger.
  4. I see a hand holding a hanger up with a multicolored shirt.
  5. Someone holding a tie dyed t shirt on a hanger

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 13: VizWiz_train_00016095.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cake on a table.

Visual question: What is the picture on this map?

Answers:

  1. rocking horse
  2. horse
  3. horse
  4. unanswerable
  5. pony
  6. rocking horse
  7. rocking horse
  8. small horse
  9. rocking horse
  10. rocking horse

Reasons why answers differ:

Image captions:

  1. A child's pink rug with a woven depiction of a toy pony.
  2. A design of a horse is on a fluffy cushion and a apple device in the background
  3. A fluffy horse is the design on a fluffy blanket
  4. Fuzzy pink fabric with a white horse design in the middle
  5. Rug featuring a white and brown rocking horse with a blue saddle

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 14: VizWiz_train_00008817.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What kind of toppings are on this pizza?

Answers:

  1. sausage
  2. toppings sausage cheese
  3. pork
  4. sausage
  5. pork
  6. sausage
  7. sausage cheese
  8. cheese sausage
  9. sausage
  10. pork

Reasons why answers differ:

Image captions:

  1. A frozen pork sausage pizza packaged in plastic wrap with a price sticker and a picture of the pizza on the front.
  2. A package of frozen pizza with pork sausage.
  3. A pre packaged ready to cook pizza with sausage, best by 07/07/2012.
  4. A sausage and cheese pizza wrapped in plastic
  5. the front of a frozen classic sausage pizza for under two dollars

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00015814.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a cell phone.

Visual question: What does this read?

Answers:

  1. 470
  2. 455
  3. 460
  4. 450
  5. 460
  6. 452
  7. 451
  8. 460
  9. 470
  10. 460

Reasons why answers differ:

Image captions:

  1. A hand holding a white ruler-type object with black letters.
  2. A scale hold by a human hand view in the image.
  3. Hand holding a white plastic device with numbers ranging from 50 to 800 with a red indication which lists the current reading at 450.
  4. Some type of barometer, labeled from to 800, is being held up to show others.
  5. Some type of white gauge with an orange indicator.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 16: VizWiz_val_00005050.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. An object that is purple in color and has some ripples.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Various shades of pink and purple streaked horizontally.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 17: VizWiz_train_00002173.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle of food.

Visual question: What is in this can? Also, what is in this box?

Answers:

  1. vegetable soup lemon jello pudding
  2. vegetable soup lemon jello
  3. pudding in box vegetable soup in can
  4. vegetable soup jello
  5. vegetarian vegetable soup jello
  6. campbells vegetarian vegetable jello pudding mix
  7. soup jell o
  8. soup
  9. vegetable soup
  10. soup jello

Reasons why answers differ:

Image captions:

  1. A package of Jell-O and Campbell's soup along with several other food items are shown.
  2. A tin can of Campbell's soup is next to a packet of Jell-o mix on the table.
  3. A wood-stained surface with a package of Jello and a can of Campbell's vegetarian soup
  4. Grocery items of different brands on a table
  5. yellow banana pudding box and Campbell's vegetable soup among other groceries

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 18: VizWiz_train_00008489.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a car on a street.

Visual question: What is this?

Answers:

  1. keyboard
  2. keyboard
  3. keyboard
  4. keyboard
  5. keyboard
  6. computer keyboard
  7. computer keyboard
  8. this computer keyboard
  9. keyboard
  10. keyboard

Reasons why answers differ:

Image captions:

  1. A large black computer keyboard of some type
  2. A portion of a keyboard of on a black laptop.
  3. A silver colored computer keyboard is displaying a partial keypad and direction pointing keys.
  4. An up close view of a black keyboard.
  5. laptop keyboard, black in color, with white lettering on keys.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 19: VizWiz_train_00006394.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person laying on a bed.

Visual question: What color is this shirt?

Answers:

  1. white
  2. white
  3. white
  4. white
  5. white
  6. white
  7. white
  8. white
  9. white
  10. white

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A white cotton t-shirt lays on a brown sheet.
  2. A white crew neck t shirt on a bed.
  3. A white short sleeved shirt with a raised collar.
  4. a white t shirt hanging off the edge of a table
  5. A white, cotton t-shirt laying on the corner of a bed.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 20: VizWiz_train_00012653.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a hot dog on a table.

Visual question: What flavor is this?

Answers:

  1. fajita chicken rice beans
  2. fajita chicken rice beans
  3. fajita chicken rice beans
  4. fajita chicken rice beans
  5. fajita chicken
  6. fajita chicken
  7. fajita chicken soup
  8. fajita chicken rice beans
  9. fajita chicken rice beans
  10. fajita chicken rice beans

Reasons why answers differ:

Image captions:

  1. A can of chunky Campbell's fajita chicken with rice and beans soup.
  2. A hand holding up a can of fajita chicken with rice and bean can.
  3. A round tin can of Campbell's Chunky brand soup is held by a left hand.
  4. An aluminum can with red label that contains soup
  5. person holding a can of Campbell's Chunky fajita chicken and rice with beans soup with the kitchen in the background.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 21: VizWiz_train_00000087.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a wall with a white sky.

Visual question: What is my favorite dog?

Answers:

  1. unanswerable
  2. yorkie
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. dashund
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A picture of sunny sky with no other objects to mention.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 22: VizWiz_train_00016754.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cat laying on top of a microwave.

Visual question: Does it say anything?

Answers:

  1. unsuitable
  2. unsuitable
  3. unanswerable
  4. no
  5. no
  6. no
  7. no
  8. unsuitable
  9. unsuitable
  10. no

Reasons why answers differ:

Image captions:

  1. A chocolate cupcake with white frosting and red sprinkles
  2. A wooden table with a chocolate cupcake and white frosting on it
  3. looks to be a picture of a dessert with vanilla frosting.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 23: VizWiz_train_00006869.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a drink.

Visual question: What kind of can of food is this?

Answers:

  1. diced tomatoes
  2. tomatoes
  3. fire roasted diced tomatoes
  4. fire roasted diced tomatoes
  5. fire roasted tomatoes
  6. fire roasted diced tomatoes
  7. fire roasted tomatoes
  8. fire roasted diced tomatoes
  9. organic fire roasted diced tomatoes
  10. fire roasted diced tomatoes

Reasons why answers differ:

Image captions:

  1. a can of Muir Glen Brand organic fire roasted diced tomatoes
  2. A hand holding a can of fire roasted tomatoes.
  3. A person is holding a can of fire roasted tomatoes in a kitchen
  4. Can of fire roasted diced tomatoes with tomatoes on the label
  5. I see a hand holding up a can of tomatoes.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 24: VizWiz_train_00021293.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 25: VizWiz_val_00003145.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a white wall.

Visual question: Do you see any type of water bugs or anything?

Answers:

  1. no
  2. no
  3. yes
  4. no
  5. rust spot
  6. no
  7. no
  8. no
  9. yes
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A white tub, silver faucet, with black drainage.
  2. Empty, white porcelain tub with stainless steel fixtures.
  3. Quality issues are too severe to recognize visual content.
  4. The top side of a bathtub with it's spout showing
  5. white plastic bath shower combination pointed at drain

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 26: VizWiz_train_00017802.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a mountain with some rocks.

Visual question: What is this?

Answers:

  1. this mountain
  2. mountains
  3. grand canyon
  4. grand canyon
  5. canyon
  6. grand canyon
  7. grand canyon
  8. grand canyon
  9. grand canyon
  10. grand canyon

Reasons why answers differ:

Image captions:

  1. A beautiful view of a mountain, overlooking other mountains around it.
  2. A picture shows the magnitude of the Grand Canyon.
  3. a view of eroded terrain and the sky
  4. Landscape picture of mountains with very clear blue sky.
  5. Scenic view of canyon with fluffy clouds in a blue sky above it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 27: VizWiz_train_00014359.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of cake.

Visual question: The crock pot setting.

Answers:

  1. high
  2. high
  3. unsuitable
  4. on
  5. low
  6. low
  7. high
  8. low
  9. unanswerable
  10. high

Reasons why answers differ:

Image captions:

  1. a slim oval shaped knob pointing to the word low above it and the word off is to the left of the knob
  2. A slow cooker with dripped food and on and off labels
  3. a white color tuner of a white color electronic warmer
  4. Front view of a crock pot that is set to a temperature that isn't low or off.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 28: VizWiz_train_00015365.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A piece of paper is sitting on the floor.

Visual question: What is this?

Answers:

  1. marcus theaters ticket stub
  2. ticket stub
  3. marcus theaters
  4. movie ticket to marcus theaters
  5. movie ticket stub
  6. ticket stub
  7. theatre stub
  8. ticket stub from marcus theatres
  9. movie stub
  10. marcus theaters ticket

Reasons why answers differ:

Image captions:

  1. A movie ticket is torn along a perforated edge on the side.
  2. A rip off stub from a ticketing booth is laid on the carpet.
  3. A ticket stub is lying on top of a brown towel.
  4. ripped paper ticket for Marcus Theaters with some text printed on it
  5. The back of a ticket stub sitting on carpet.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 29: VizWiz_train_00001733.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A dog that is sitting on a wooden floor.

Visual question: What color is my dog?

Answers:

  1. very cute golden brown
  2. tan
  3. light brown white
  4. light brown
  5. beige
  6. light tan
  7. red
  8. yellow
  9. red
  10. yellow golden

Reasons why answers differ:

Image captions:

  1. A brown dog laying down against the photographer
  2. A person's right hand rests on a small light brown and white dog.
  3. a reddish dog laying beside a person wearing blue pants
  4. A small blonde dog on a man's lap getting petted
  5. An elderly hand in a white sweatshirt petting a golden furred small dog whose head is on their blue sweatpants.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 30: VizWiz_val_00005009.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a bag of colmans chicken casserole that is in a package
  2. A package of casserole mix for chicken on a counter.
  3. A package of Colman's Norwich Chicken Casserole Recipe Mix
  4. A prepackaged serving of something known as chicken casserole.
  5. The front of a package of Chicken casserole mix.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_val_00007656.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A finger points to a wooden frame next to a floor.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. The inside of someone's hand covering the camera lens.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 32: VizWiz_train_00004724.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cell phone is sitting on a table.

Visual question: What is in this box?

Answers:

  1. chicken satay sauce
  2. unanswerable
  3. chicken satay sauce
  4. chicken
  5. chicken meal
  6. chicken satay sauce
  7. chicken satay sauce
  8. chicken sauce
  9. chicken satay
  10. chicken satay sauce

Reasons why answers differ:

Image captions:

  1. A box of chicken with satay sauce is on a wooden floor.
  2. A box of prepared chicken in satay sauce.
  3. A package of a chicken dish is on a wooden surface.
  4. Quality issues are too severe to recognize visual content.
  5. The container has a meal of chicken with satay sauce in it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00013296.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of an empty view of a ball.

Visual question: What color are the pair of pants that I'm holding?

Answers:

  1. black
  2. grey
  3. unanswerable
  4. grey
  5. grey
  6. light brown
  7. tan
  8. tan
  9. light grey
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A dull brown piece of fabric from some cloth or piece of furniture.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Some type of grey fabric that has no stains.
  5. tan piece of cloth with a bright light shining on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 34: VizWiz_train_00009875.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bunch of green flowers.

Visual question: What is this?

Answers:

  1. green beans
  2. reciept
  3. whole green beans
  4. green beans
  5. green beans
  6. green beans
  7. green beans
  8. green beans
  9. green beans
  10. beans

Reasons why answers differ:

Image captions:

  1. a close up of a package of green beans
  2. a container of fresh uncut green beans with the price tag
  3. An image of green beans on a wooden table is being depicted.
  4. Fresh green beans package in a clear wrap and black tray with a white barcode label.
  5. Whole fresh green beans packaged in plastic with a price label.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 35: VizWiz_train_00007133.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blue wall with a half.

Visual question: What color is this?

Answers:

  1. white
  2. white
  3. grey
  4. blue
  5. blue
  6. white
  7. blue grey
  8. white
  9. black
  10. white

Reasons why answers differ:

Image captions:

  1. A white sheet or shirt with no other identifiable marking is seen.
  2. A wrinkled piece of fabric is being displayed.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 36: VizWiz_train_00016372.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a cloudy sky.

Visual question: what do you mean its not playing

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. unsuitable
  7. unsuitable
  8. unanswerable
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. PICTURE IS TOO BLURRY AND UNABLE TO ANALYZE IT
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 37: VizWiz_val_00007100.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A angular view of the upper part of a room showing a closet shelf with colorful cushions, and a wall mirror next to the closet.
  2. A mostly empty closet with couch cushions on the top shelf
  3. A white ceiling has indentation lines on it.
  4. Quality issues are too severe to recognize visual content.
  5. The ceiling and the top of a closet with cushions on the top shelf of closet.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 38: VizWiz_val_00005007.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A ticket or booklet around the same size of a palm
  2. A person holding a card with a large iPod in the black and white picture on the card.
  3. a person holding a small piece of paper with text and an image
  4. A person's hand holding a white card with black and white photo and text on it.
  5. Black and white photo that has a mp3 player in the middle with two guitars to either side.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 39: VizWiz_train_00009697.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A cup of coffee is sitting on a table.

Visual question: What label is on this can?

Answers:

  1. unanswerable
  2. unanswerable
  3. nutrition label
  4. back label
  5. unanswerable
  6. unanswerable
  7. unanswerable
  8. unanswerable
  9. unanswerable
  10. red 1

Reasons why answers differ:

Image captions:

  1. A bowl of microwavable soup on a tan desk.
  2. A can of soup sits on a wooden table
  3. a small bowl of Campbell's brand microwavable soup
  4. an opened cup of soup with a red and white packaging showing the nutrition label
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 40: VizWiz_train_00010686.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A pair of scissors that are on a plate.

Visual question: What kind of cereal is this?

Answers:

  1. special k
  2. vanilla almond
  3. kelloggs special k
  4. unsuitable
  5. kellogg vanilla almond
  6. vanilla
  7. special k
  8. special k vanilla almond
  9. kelloggs special k
  10. special k

Reasons why answers differ:

Image captions:

  1. A box of Special K vanilla almond cereal
  2. A close up of a box of Kellogg's Special K cereal.
  3. A white cardboard box of breakfast cereal with red, blue and brown lettering.
  4. MULTI FLAVORED CEREAL INSIDE A RED WHITE AND YELLOW BOX
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00020987.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. Quality issues are too severe to recognize visual content.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 42: VizWiz_train_00016922.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a knife.

Visual question: What is this?

Answers:

  1. cherry cough drops
  2. toaster pastries
  3. unsuitable
  4. dafgv
  5. cherry toaster pastries
  6. unsuitable
  7. unsuitable
  8. toasted pastries
  9. cherry toaster pastries
  10. frosted cherry toaster pastries

Reasons why answers differ:

Image captions:

  1. 16 pack of Cherry flavor toaster pastries from Walmart
  2. A box of frosted cherry toaster pastries with cherries.
  3. A white and red box of cherry breakfast toaster pastries.
  4. box of frosted cherry toaster pastries it contains 16
  5. The front side of a cherry box with white coloring

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 43: VizWiz_train_00003641.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a white toilet with a light on it.

Visual question: Can you tell me what this is a bottle of? What is this bottle of?

Answers:

  1. suave
  2. unsuitable
  3. shampoo
  4. unsuitable
  5. suave volumizing
  6. suave volumizing moose
  7. suave volumizing
  8. shampoo
  9. suave shampoo
  10. shampoo

Reasons why answers differ:

Image captions:

  1. A bottle of shampoo against a dark background.
  2. A white bottle of Suave volumizing with a purple top.
  3. A white shampoo bottle has a purple cap
  4. Describe images taken by people who are blind
  5. It peers to be a shampoo bottle that says suave

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 44: VizWiz_train_00008766.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: Is this lemon juice or lime juice? Thank you.

Answers:

  1. lime juice
  2. lime juice
  3. lime juice
  4. lime juice
  5. lime
  6. lime
  7. lime juice
  8. lime
  9. lime
  10. lime

Reasons why answers differ:

Image captions:

  1. A green bottle of from concentrate Lime Juice
  2. Green and yellow bottle of lime juice on table.
  3. green bottle that says lime juice in English and Spanish
  4. Green container filled with lime juice showing a bright green label
  5. the front of a green bottle of lime juice

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 45: VizWiz_train_00014871.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black and white photo of a street sign.

Visual question: What's this?

Answers:

  1. parfum
  2. cologne
  3. perfume
  4. perfume
  5. rare gold perfume
  6. rare gold
  7. cologne called rare gold
  8. parfume
  9. perfume
  10. rare gold

Reasons why answers differ:

Image captions:

  1. A box of perfume called "Rare Gold" sits on top of a counter/table/desk.
  2. A box of Rare Gold perfume spray is on top of a black surface.
  3. A box with Rare Gold printed on it laying face up on a dark surface.
  4. A cardboard perfume box for Rare Gold in black with gold accents rests on a dark surface.
  5. The front of a package of "Rare Gold" perfume spray.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_val_00007406.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. An image that is too blurry to make out but shaped like a sideways rectangle.
  2. Quality issues are too severe to recognize visual content.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 47: VizWiz_train_00021676.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a bar of Ghirardelli twilight delight chocolate on a tablecloth
  2. A box of Ghirardelli chocolate called twilight delight with images of chocolate bars on the front of it.
  3. Ghirardelli dark chocolate container on top of a green rug
  4. Hand holding a box of chocolates on top of a yellow fabric.
  5. The rectangular object is by a popular brand and has the word chocolate on it.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00013133.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard on a desk.

Visual question: What is this?

Answers:

  1. hershey bar
  2. candy bar
  3. chocolate
  4. hershey bar
  5. hershey chocolate
  6. hershey bar
  7. unsuitable
  8. chocolate
  9. candy bar
  10. candy

Reasons why answers differ:

Image captions:

  1. a candy bar packaging with the word Hershey in large grey text
  2. a large package of Hershey's brand chocolate on a counter
  3. A plastic grocery bag and a brown sign placed on a white table.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00019552.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A green plate filled with broccoli on a table.

Visual question: Does this broccoli look moldy at all?

Answers:

  1. no
  2. no
  3. no
  4. no
  5. yes
  6. no
  7. maybe spots
  8. no
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. a bunch of raw broccoli florets in a bowl
  2. A fresh vegetable is on the plate and shown in this image.
  3. a green ceramic plate with green plain broccoli florets in a pile on it
  4. A plate full of raw broccoli sitting atop a wooden table.
  5. A teal plate with raw broccoli florets on it

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 50: VizWiz_train_00015863.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a tv screen.

Visual question: Can you read what's on my screen please?

Answers:

  1. key matchup
  2. yes
  3. key mathcup air force option attack unlv front 7
  4. unanswerable
  5. yes
  6. yes
  7. yes
  8. yes
  9. key matchup
  10. yes

Reasons why answers differ:

Image captions:

  1. a screen showing information about a sports match up with the military
  2. A television screen that has a program right on it.
  3. A TV screen displaying football match up of Air Force and UNLV.
  4. a TV screen with sports information about a matchup
  5. Close up of TV graphic featuring a college sports game

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Showing images 0 - 0 out of 0 matching images.