Showing images 0 - 0 out of 0 matching images.

Images are displayed from Training and Validation sets only.
Hover over image to zoom in.

Image 1: VizWiz_train_00015899.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bag of food.

Visual question: What is this?

Answers:

  1. chicken breast meat for fajitas
  2. chicken breast meat for fajitas
  3. chicken breast meat for fajitas
  4. chicken breast fajita meat
  5. chicken
  6. chicken breast meat for fajitas
  7. chicken breast meat for fajitas
  8. chicken breast meat for fajitas
  9. chicken breast meat
  10. chicken breast boneless rib meat for fajitas

Reasons why answers differ:

Image captions:

  1. a 28 oz bag of frozen chicken breast for fajitas
  2. A package of LiveSmart Chicken Breast Meat for Fajitas
  3. A plastic bag with various food items for cooking
  4. A two pound bag of chicken breast meat for fajitas
  5. Big bag of chicken breast usually used for tacos.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 2: VizWiz_train_00008382.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle on a table.

Visual question: What is this?

Answers:

  1. unsuitable
  2. medicine
  3. hair dye
  4. sdfsad
  5. unanswerable
  6. hair dye
  7. pureology
  8. bottle eye wash
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A bottle of hair care product that is good for colored hair
  2. A pink bottle of cream is laid on a black surface, the brand displayed on the bottle is Pureology.
  3. a white bottle with purple lid on top of a wooden floor
  4. A white plastic bottle with a purple lid, brand pureology.
  5. The top of a hair product bottle that is on a hardwood surface.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 3: VizWiz_train_00023437.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What do you see in this photograph in detail. It's a room and i want to know the placement of objects.

Answers:

  1. unanswerable
  2. partitions
  3. office cubicles
  4. green plant large windows
  5. office cubicles
  6. cubicles in office potted plant on file drawer
  7. envelope pinned on wall directly in front cubicles to either side plant on file cabinet in middle
  8. cubicles filing cabinet ahead windows to left
  9. office
  10. plants on desk

This image does not have annotations for Reasons Why Answers Differ.

This image does not have annotations for Captions.

This image does not have annotations for Skills.

This image does not have annotations for Quality Issues.

This image does not have annotations for Text Presence.

Image 4: VizWiz_val_00003898.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A living room with a couch and a window.

Visual question: What color are these pants?

Answers:

  1. red
  2. unsuitable
  3. blue
  4. blue
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. blue
  10. black

Reasons why answers differ:

Image captions:

  1. A couch in front of a door and window with a mini-blind covering it.
  2. A sofa is in front of a white door.
  3. Quality issues are too severe to recognize visual content.
  4. The front door and window of someone's home.
  5. White door to the left, window with blinds to the right a red chair at the forefront.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 5: VizWiz_val_00004215.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a colorful colored bird.

Visual question: What is this?

Answers:

  1. i do no
  2. wristband
  3. knitting
  4. sock
  5. unanswerable
  6. unanswerable
  7. sock
  8. sock
  9. sock
  10. sock

Reasons why answers differ:

Image captions:

  1. A knit item that is similar to a scarf is colored teal, yellow and two light greens.
  2. A knitted or crocheted item is lying on someone's knee.
  3. Blue and yellow striped knitted thing laying across someone.
  4. it appears to be a knitting blue and yellow sock maybe
  5. The band is crocheted and is bright blue and lime green in color.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 6: VizWiz_train_00019678.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a black meter.

Visual question: Can you see the brand and/or model of this CD player at this angle?

Answers:

  1. no
  2. no
  3. no
  4. unsuitable
  5. unsuitable
  6. no
  7. no
  8. no
  9. no
  10. no

Reasons why answers differ:

Image captions:

  1. A white Cd Roms disc player sitting a surface.
  2. An old large portable cd player and AM/FM radio .
  3. I see the top view of a white CD player.
  4. some type of black and white electronic device
  5. the top of a portable radio boombox with CD player

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 7: VizWiz_train_00021342.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A box of minute maid premium orange juice
  2. a carton of Minute Maid Heart Wise orange juice
  3. A carton of Minute Maid Heart Wise orange juice.
  4. Carton of Minute Maid Premium Heart Wise Orange Juice.
  5. The jug of orange juice is against the wall.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 8: VizWiz_train_00014775.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A table with two plates of food on it.

Visual question: What is it?

Answers:

  1. soup
  2. food
  3. noodles tomato sauce
  4. food
  5. food
  6. looks like mexican food
  7. food
  8. soup bread
  9. chinese meal
  10. asian inspired meal for 2

Reasons why answers differ:

Image captions:

  1. A tray is holding several plates with what looks to be a half eaten meal.
  2. a tray of a half eaten meal containing multiple dishes
  3. A tray of fast food maybe Indian there is naan bread and a red soupy substance in a black bowl
  4. A tray with various bowls and plates that contain food, food remnants, or dipping sauce.
  5. image quality is high to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 9: VizWiz_train_00010011.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A view of a person on a chair.

Visual question: What's in this image?

Answers:

  1. pipes
  2. subway station
  3. office
  4. unsuitable
  5. unanswerable
  6. room
  7. unsuitable
  8. basement
  9. column
  10. poles

Reasons why answers differ:

Image captions:

  1. a room with big black poles with yellow tops and seats in it
  2. a sideways picture of some sort of inside structure
  3. An indoor scene showing several benches positioned back-to-back, next to floor-to-ceiling pillars.
  4. Lobby with large screen and black and yellow poles.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 10: VizWiz_train_00014656.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a book on a table.

Visual question: Okay, let's try this again. What is the brand name of the lasagna?

Answers:

  1. unanswerable
  2. food
  3. unanswerable
  4. unanswerable
  5. unanswerable
  6. meat 4 cheese
  7. unsuitable
  8. meat lasagna 4 cheeses
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A corner picture of a meat lasagna 4 cheese dinner box.
  2. A frozen meat lasagna meal with cheese on a blue package.
  3. A package of frozen meat lasagna is on a table.
  4. A picture of a box of meat lasagna with four cheeses.
  5. frozen boxed dinner sets on a wooden cutting board next to a white stove

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 11: VizWiz_val_00002587.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person standing on a couch with a remote.

Visual question: Does this stack look too big?

Answers:

  1. i dont see anything stacked
  2. unanswerable
  3. unanswerable
  4. no
  5. unanswerable
  6. unanswerable
  7. no
  8. no
  9. no
  10. gfd

Reasons why answers differ:

Image captions:

  1. A person in jeans and plaid shirt standing in front of a counter and another person's leg sticking off a couch.
  2. A person is sitting on a couch with their leg out towards a standing person.
  3. A person standing up facing away wearing blue jeans with a white and plaid shirt.
  4. Man with jeans standing visible from the middle back down..
  5. Somebody pointing their foot at the back of someone else's leg.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 12: VizWiz_train_00006191.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a laptop on a desk.

Visual question: What is it?

Answers:

  1. table
  2. kinect
  3. kinect
  4. table
  5. desktop
  6. floor
  7. table
  8. wireless sensor
  9. unsuitable
  10. desk

Reasons why answers differ:

Image captions:

  1. A cable that leads to a DVD player and that has the word Kinect on it.
  2. A Kinect device is shown next to a wooden flat surface.
  3. A portion of a Kinect gaming accessory is shown on a desktop.
  4. the bottom of a monitor resting on a wooden desk
  5. The front of a motion sensing electronic device beneath a television monitor.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 13: VizWiz_val_00000528.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: Can you see actually the whole screen now and did it actually change when I hit enter?

Answers:

  1. no
  2. no dont know
  3. unanswerable
  4. no
  5. unanswerable
  6. whole screen now
  7. user license agreement
  8. unanswerable
  9. yes on license agreement
  10. unanswerable

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a close up picture of a computer screen of a set up window showing
  2. a laptop computer showing end user license agreement
  3. A laptop is open and there is a licensing agreement.
  4. A laptop screen displaying a user license agreement.
  5. it's a windows vista user agreement box for setting up windows

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 14: VizWiz_train_00016085.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup on a table.

Visual question: What is this can?

Answers:

  1. food
  2. spaghetti
  3. spaghetti os
  4. spaghetti os
  5. spaghettios
  6. spaghetti os
  7. spaghettios
  8. spaghetti os
  9. spaghettios
  10. spaghettios

Reasons why answers differ:

Image captions:

  1. A hand holding a can of Spaghetti O's in the kitchen.
  2. A hand holding an aluminum can of spaghettio's
  3. A partial image of a can that probably contains spaghetti.
  4. A person is holding up a can of pasta.
  5. A tin can of Campbell's spaghetti is held in a hand.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 15: VizWiz_train_00016827.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A refrigerator that is sitting in a kitchen.

Visual question: What are the instructions on the box?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unanswerable
  5. paper
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. baking
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. Back of a Nestle cardboard box showing the nutritional facts and there are a bunch of kitchen items in the background
  2. table with different bottles may be related to detergent or wine
  3. The back of a box of food in a kitchen with paper towels and a toaster in the background.
  4. The back of a box of instant ready to heat food.
  5. the back of a package of food showing nutritional information

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 16: VizWiz_val_00006837.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A book with written texts on the cover page that is sitting on time a table.
  2. A close up of a page of a textbook in a foreign language.
  3. A cover page for a compilation of files
  4. Beginning page of a book or packet of some sort, like a title page, not written in English.
  5. White paper page with big and small black font lettering on it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 17: VizWiz_train_00007227.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person sitting on a bench.

Visual question: Tell me what the charcoals look like. Are they ready to cook yet?

Answers:

  1. no
  2. black small flames yes
  3. yes
  4. mostly black not ready yet
  5. black white no
  6. black some grey not yet
  7. no
  8. no
  9. black
  10. barely white not ready yet

Reasons why answers differ:

Image captions:

  1. A charcoal grill with lit burning charcoal inside.
  2. A person is standing in front of a grill with burning charcoal briquettes inside of it.
  3. A set of feet standing beneath the grate of a charcoal fire pit that is slowly burning out.
  4. A view of someone's grill outside with the grill being lit.
  5. Quality issues are too severe to recognize visual content.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 18: VizWiz_train_00022199.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of an alcoholic drink being held by someone.
  2. A glass bottle filled with orange flavored beverage
  3. A person's hand is holding a bottle of DeKuyper triple sec.
  4. A wonderful view of the fog windows in the room is very thick
  5. Hand holding a bottle of DeKuyper orange alcohol

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 19: VizWiz_train_00008209.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a red and black frisbee.

Visual question: Can you please tell me what it says on this CD, what does it say on this CD?

Answers:

  1. writing not visible
  2. unanswerable
  3. not readable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unsuitable
  9. unanswerable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. a blurry close up of a cd the type is indiscernible
  2. A button in an elevator with a red ring around it.
  3. A circular object is shown, with a bright light emitting an aura in the background.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 20: VizWiz_train_00014010.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A person holding a blue phone with a mouse.

Visual question: What is this CD?

Answers:

  1. unsuitable
  2. unanswerable
  3. blank
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. memorex cd r
  8. unsuitable
  9. read only cd
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A hand is holding a CD in a purple sleeve holder.
  2. A hand is holding a CD-R in a blue plastic case.
  3. A person holding a blue plastic CD case in their left hand.
  4. Blue plastic packaging for writable DVD with computer in background
  5. Pictured is a computer cd inside of a blue sleeve.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:2 / 5 annotators

Image 21: VizWiz_val_00003397.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a bottle on a counter.

Visual question: What is in this spray bottle?

Answers:

  1. unsuitable
  2. unanswerable
  3. unsuitable
  4. unanswerable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. unsuitable
  9. unsuitable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A brown counter with a white spray bottle on top
  2. A cleaning bottle of some sort laying flat with the directions on the back facing up.
  3. A white spray bottle with instruction on the back of it.
  4. some type of liquid that is in a container
  5. WHITE SPRAY BOTTLE PLACED ON A BROWN RUG

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 22: VizWiz_val_00003130.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cake on a table.

Visual question: What is this?

Answers:

  1. unanswerable
  2. penbino
  3. game
  4. penbino
  5. penbino
  6. penbino
  7. toy
  8. book
  9. unanswerable
  10. toy

Reasons why answers differ:

Image captions:

  1. A box of what seems to be a board game, with a picture of a large, pink penguin on a grassy land with a brown fence behind it on display.
  2. A children's toy in a box, with a cartoon penguin on it.
  3. A pink and white penguin graphic on a child's boxed toy
  4. A pink penguin on a box sitting sideways.
  5. a stuffed animal box for a pink penguin.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 23: VizWiz_train_00011242.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up view of a person that is shown.

Visual question: How color?

Answers:

  1. dark brown
  2. blonde
  3. brown
  4. silver
  5. dark brown
  6. dark brown
  7. black
  8. dark brown
  9. silver
  10. black

Reasons why answers differ:

Image captions:

  1. A close up of brown fur or hair that covers the entire frame.
  2. Here is a photo of the top of somebody's head that has shorter darker brown hair.
  3. In this photo is the back of someone's head with black hair.
  4. someone hair on the back of their head its black
  5. the back of someone's head who has short black hair

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 24: VizWiz_train_00004237.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A picture of a vase with flowers on it.

Visual question: Hello,web workers just like to check the picture on this card please.

Answers:

  1. birthday card
  2. have lovely birthday
  3. teacup flowers
  4. tea cup saucer flower cupcakes
  5. birthday card
  6. happy birthday
  7. birthday card
  8. teapot 2 cupcakes
  9. teacup
  10. teacup cupcakes

Reasons why answers differ:

Image captions:

  1. A birthday card with a teacup, flowers, and cupcakes on it that says have a lovely birthday.
  2. A white birthday card with flowers and a tea cup on the front.
  3. Beautiful view from behind the walls hidden under dark mist
  4. I see a birthday card on the table
  5. the front of a birthday card that reads "have a lovely birthday"

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 25: VizWiz_val_00003301.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a cup of coffee on a table.

Visual question: Could you tell me what kind of chips these are please

Answers:

  1. no
  2. regular
  3. classic
  4. lays plain
  5. classic
  6. classic
  7. unanswerable
  8. lays
  9. lays classic
  10. lays classic chips

Reasons why answers differ:

Image captions:

  1. A bag of Lay's brand classic potato chips.
  2. A package of potato chips is ready to be eaten,
  3. A yellow red and white bag of Lay's chips is unopened.
  4. Part of a yellow and red Lays classic potato chip bag.
  5. The chips appears to be Lays potato chips classic in the picture.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 26: VizWiz_val_00005352.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a can of extra creamy dairy whipped topping sitting on a desk
  2. a can of extra creamy whipped cream with a white and purple label and a blue top
  3. a very large can of extra creamy whipped topping on a counter
  4. a white bottle of Extra Creamy dairy Whipping
  5. An aerosol container of cream whip is on someone's desk

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 27: VizWiz_val_00006940.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A bottle of chili condiment in a plastic flash.
  2. A container of paprika spice is held in a kitchen.
  3. Quality issues are too severe to recognize visual content.
  4. someone is holding a white 2 oz bottle of paprika
  5. white container of pimentos that looks like liquid in someone's hand, chair in the background

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 28: VizWiz_train_00014982.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a refrigerator in a room.

Visual question: Try this again. Can you find a key code for Microsoft Office 2010 Professional?

Answers:

  1. no key code
  2. no
  3. unsuitable
  4. unsuitable
  5. unanswerable
  6. unanswerable
  7. unsuitable
  8. no
  9. unsuitable
  10. no

Reasons why answers differ:

Image captions:

  1. a fridge of some sort that is very stainless steel
  2. A large white wall partially obscured in shadow.
  3. A white refrigerator is bathing in the sunlight from the window.
  4. Quality issues are too severe to recognize visual content.
  5. white with a little tan on the doors could be cabinet or closet

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:0 / 5 annotators

Image 29: VizWiz_train_00003775.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a box of paper.

Visual question: What is this for?

Answers:

  1. tissues
  2. face tissue
  3. tissues
  4. tissue
  5. tissues
  6. runny nose
  7. blowing nose
  8. blowing nose
  9. wiping noses
  10. tissue

Reasons why answers differ:

Image captions:

  1. A black box with tissues is on the bed
  2. A box of tissues is seen with a tissue coming outside of the box.
  3. A grey box of Puff tissues sitting on a fabric surface.
  4. An box of Kleenex on top o an blanket of wool cotton.
  5. appears to be a picture of a box of tissues

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 30: VizWiz_train_00003693.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: There is a sign on the side of a car.

Visual question: What is this?

Answers:

  1. dish soap
  2. dishsoap
  3. palmoliveoxy plus dish detergent
  4. palmolive oxy power degreaser dish detergent
  5. palmolive dish soap
  6. dish soap
  7. dish soap
  8. palmolive oxy plus dishwashing liquid
  9. dish washing soap
  10. dish soap

Reasons why answers differ:

Image captions:

  1. A bottle of degreaser is on top of a table.
  2. a bottle of Palmolive OXY Plus power degreaser
  3. For Palmolive oxy plus company product is shown by here.
  4. Palmolive oxy plus degreaser on a tile surface.
  5. the back of a Palmolive Oxy power degreaser dish soap

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 31: VizWiz_val_00005778.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. -This a yellow box with a bag in it
    -The box is on the counter in front of a kitchen sink
    -The box says Maizena
    -The text is in another language
  2. A box of corn starch called Maizena it is yellow and on the counter by the sink
  3. A small box of flour is opened in front of the sink.
  4. Open yellow box of Maizena cereal on a counter in front of a sink and faucet.
  5. Some kind of boxed cooking mix, box is yellow with black writing that is not English.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 32: VizWiz_train_00013123.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a plate of food.

Visual question: What is in this box?

Answers:

  1. chicken dijon
  2. chicken dijon
  3. chicken dijon
  4. chicken dijon
  5. chicken
  6. chicken dijon
  7. chicken dijon
  8. chicken dijon
  9. chicken dijon
  10. food

Reasons why answers differ:

Image captions:

  1. A container of food has a picture of the food
  2. A placemat with a TV dinner that says "Chicken Dijon" on it
  3. A small box for a TV dinner sits on the table with the image of pasta and chicken on the front of it.
  4. Packaged food box with chicken and rice illustration on placemat.
  5. Pictured is a microwavable Chicken Dijon meal sitting on top of a yellow woven pot holder.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 33: VizWiz_train_00021890.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. a close up shot of a box of jello gelatin
  2. a picture of an yellow and blue box
  3. A small box of orange sugar free Jell-O.
  4. blue, red, and orange box of sugar free Jello sitting on a brown surface
  5. Orange jello was a family favorite when we were younger.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 34: VizWiz_train_00001842.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a keyboard on a wooden table.

Visual question: What is this, and what color is it?

Answers:

  1. silver computer keyboard
  2. white keyboard
  3. keyboard
  4. keyboard white silver
  5. keyboard grey
  6. keyboard silver
  7. keyboard grey
  8. grey keyboard
  9. keyboard white
  10. keyboard silver

Reasons why answers differ:

Image captions:

  1. a mac desktop keyboard sitting on a wooden table/desk
  2. A silver and white qwerty keyboard sitting on a wooden surface.
  3. A silvery keyboard lies on top of a wooden table.
  4. computer keyboard with letters, resting on wooden surface.
  5. Here is a picture of someone sitting at a desk with a keyboard.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 35: VizWiz_val_00001733.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a person holding a toothbrush.

Visual question: Can you tell what flavor this is?

Answers:

  1. unsuitable
  2. orange
  3. unsuitable
  4. unsuitable
  5. tangerine
  6. orange
  7. tangerine
  8. grapefruit
  9. unanswerable
  10. unanswerable

Reasons why answers differ:

Image captions:

  1. A box of Crystal Lege containing 10 individual sachets of grapefruit and orange flavor.
  2. A box of flavored water mix in individual packs
  3. A container that holds 10 individual packets of juice in it, there is a sliced orange on the container.
  4. a package of ten Cristal Leger tangerine water flavoring packets
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 36: VizWiz_train_00018115.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a yellow plate on a table.

Visual question: What is this?

Answers:

  1. smart balance
  2. smart balance butter
  3. butter
  4. smart balance
  5. smart balance spread
  6. smart balance spread
  7. smart balance spread
  8. smart balance
  9. margarine
  10. smart balance

Reasons why answers differ:

Image captions:

  1. A circular cap with inner yellow circle showing label in hands.
  2. A finger holding a lid above a towel.
  3. A tub of smart balance butter is held up.
  4. the top of a smart balance spread tub being held so that we can read the top
  5. Yellow and white top of a Smart Balance container with blue lettering.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 37: VizWiz_train_00005321.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blender in a kitchen.

Visual question: What is this?

Answers:

  1. perfume bottle
  2. perfume
  3. bottle perfume
  4. perfume
  5. unsuitable
  6. perfume
  7. perfume
  8. unsuitable
  9. perfume
  10. perfume

Reasons why answers differ:

Image captions:

  1. A clear bottle of liquor with a golden outlined label and cap.
  2. A clear glass bottle of perfume stands on a white surface.
  3. An empty glass perfume bottle with a white label edged in gold.
  4. Glass bottle that looks like it could be either a perfume bottle or a liquor bottle but is too hard to tell because it is blurry.
  5. glass perfume bottle containing pink liquid with gold details around the label and the cap.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 38: VizWiz_train_00014160.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A white plate sitting on top of a table.

Visual question: How is this image? Is it sufficient to get the expiration date?

Answers:

  1. no
  2. unsuitable
  3. unsuitable
  4. no bring closer to can
  5. no
  6. unsuitable
  7. unsuitable
  8. no
  9. need direct shot top
  10. no too angled

Reasons why answers differ:

Image captions:

  1. a can on the table with remote controller
  2. Quality issues are too severe to recognize visual content.
  3. Top of a can of food on a table, but image is too blurry to read.
  4. Two chairs are by a table with food products
  5. Various items sit on a wooden table with wooden chairs around it.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 39: VizWiz_train_00020644.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A ball on the floor and also some plugs.
  2. Carpeted floor and desk ball in front of a cross-legged individual.
  3. Persons legs sitting in a room with equipment and electrical wires, a grey exercise ball, and a plastic drawer set.
  4. Quality issues are too severe to recognize visual content.
  5. The floor of a carpet room, with a large exercise ball, plastic dresser, and other items visible.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 40: VizWiz_train_00000500.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a sign on a wall.

Visual question: What is this item?

Answers:

  1. sage
  2. ground sage
  3. bottle ground sage
  4. sage ground
  5. sage
  6. sage ground spice
  7. ground sage
  8. sage ground
  9. ground sage
  10. sage

Reasons why answers differ:

Image captions:

  1. A 5 ounce bottle of ground sage powdered spice.
  2. A container of ground sage with a red and yellow label
  3. A wonderful view of the fog windows in the room is very thick
  4. An unopened pack of sage ground seasoning made by Marshalls Creek Spices.
  5. close up of a bottle of sage seasonings.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 41: VizWiz_train_00022375.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A metallic plate with the Marches from MacDonald next to a menu showing an image of a chicken sandwich.
  2. A paper with a burger sandwich image talking about Mcdonalds tender grilled chicken thigh.
  3. A picture of a sandwich shows a bun and lettuce.
  4. a yellow and pink color paper showing some info
  5. Directions for chicken are shown on this packaging.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 42: VizWiz_train_00014880.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A black dog sitting on top of a couch.

Visual question: When is the, when does this expire?

Answers:

  1. unanswerable
  2. unanswerable
  3. unanswerable
  4. unsuitable
  5. unsuitable
  6. unsuitable
  7. unsuitable
  8. unanswerable
  9. unsuitable
  10. unsuitable

Reasons why answers differ:

Image captions:

  1. A package that is held up that might be orange in color.
  2. A person's legs in black pants sitting in a room with green carpet at a desk.
  3. Quality issues are too severe to recognize visual content.
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 43: VizWiz_train_00023339.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A cell phone with a red case plugged in to a white cord.
  2. A magenta colored cell phone sitting on a grey table plugged into a pink and white charging cord.
  3. A phone in a pink cover is plugged into the wall.
  4. a smartphone in a pink carrying case with a white USB cable plugged in
  5. Pink leather phone case with curved lives in it

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:1 / 5 annotators

Image 44: VizWiz_val_00000042.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

Visual question: What does this fortune cookie say?

Answers:

  1. unsuitable image
  2. unsuitable image
  3. too blurry
  4. unsuitable image
  5. unsuitable image
  6. unsuitable image
  7. unsuitable image
  8. unsuitable image
  9. unanswerable
  10. unsuitable image

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A close up of a sheet of paper on a wooden desk.
  2. A piece of paper is on the brown table but it is very blurry.
  3. A white sheet of paper has blurry text on it
  4. Quality issues are too severe to recognize visual content.
  5. Quality issues are too severe to recognize visual content.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:4 / 5 annotators

Image 45: VizWiz_train_00020240.jpg

A photo captured by a person who is blind. A caption for this photo is currently not available.

This image does not have an associated Visual Question with Answers.

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A green and white tube of cucumber melon hand gel.
  2. A small container of hand gel sits on a light wooden table.
  3. A tube of cucumber melon hand restoring gel on top of a wooden table.
  4. A white and green tube of cucumber melon hand gel.
  5. antibacterial hand gel with a touch of cucumber to soften hands.

This image does not have annotations for Skills.

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 46: VizWiz_train_00007552.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A box of food sitting on top of a table.

Visual question: What kind of cheese is this?

Answers:

  1. sharp cheddar shredded
  2. fancy shredded sharp cheddar
  3. sharp cheddar
  4. sharp cheddar
  5. shredded sharp cheddar
  6. sharp
  7. sharp cheddar
  8. shredded sharp cheddar
  9. sharp cheddar
  10. shredded sharp cheddar

Reasons why answers differ:

Image captions:

  1. A bag of fancy sharp cheddar cheese is on a counter.
  2. A bag of sharp shredded cheddar cheese sitting on a kitchen countertop
  3. A bag of shredded sharp cheddar cheese from HEB
  4. a picture of a sharp cheddar cheese package.
  5. Black bag of HEB fancy shredded cheddar cheese

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 47: VizWiz_train_00001184.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blender on a table.

Visual question: What is this?

Answers:

  1. canned food
  2. cans
  3. unsuitable
  4. can food
  5. juice
  6. unsuitable
  7. unanswerable
  8. cans
  9. cans
  10. food

Reasons why answers differ:

Image captions:

  1. Containers of food, but they are too out of frame to read the labels.
  2. Quality issues are too severe to recognize visual content.
  3. Two aluminum cans, one green and one orange
  4. Two tin cans have images of food on them
  5. Two tin cans sitting on a wood surface next to each other.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 48: VizWiz_train_00009139.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a container of food.

Visual question: What is this?

Answers:

  1. vegetable blend
  2. veggie
  3. lemon garlic seasoned vegetable blend
  4. vegetables
  5. lemon garlic seasoned vegetable blend
  6. lemon garlic seasoned vegetable blend
  7. lemon garlic seasoned vegetable blend
  8. vegetable blend
  9. lemon garlic seasoned vegetable blend
  10. vegetables

Reasons why answers differ:

Image captions:

  1. A bag of frozen assorted vegetables with a see through label.
  2. A bag of frozen lemon garlic seasoned vegetables.
  3. A partial image of a package that contains seasoned vegetables.
  4. a red and white bag of lemon garlic seasoned vegetables on top of a counter
  5. Live Smart bag of Lemon Garlic Seasoned Vegetables.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:5 / 5 annotators

Image 49: VizWiz_train_00007742.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a red and white sign.

Visual question: Can you tell me what size this is?

Answers:

  1. xxl
  2. xxl
  3. xxl
  4. xxl
  5. xxl
  6. xxl
  7. xxl
  8. xxl
  9. xxl
  10. xxl

This image does not have annotations for Reasons Why Answers Differ.

Image captions:

  1. A green blue and yellow fabric with red tape on top
  2. A pair of blue pajamas with yellow stars and a red tag on them.
  3. appears to be a picture of a tag
  4. Green blanket with stars that has a XXL tag on it
  5. image shows a size tag of a blue cloth's.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:3 / 5 annotators

Image 50: VizWiz_train_00019595.jpg

A photo captured by a person who is blind. A computer-generated caption for this photo is: A close up of a blender on a table.

Visual question: What is that?

Answers:

  1. coffee mug
  2. coffee to go mug
  3. mug
  4. coffee cup
  5. travel mug
  6. cup
  7. travel mug
  8. travel mug
  9. black mug
  10. black cup

Reasons why answers differ:

Image captions:

  1. A black plastic spill-free coffee mug on a wooden counter.
  2. a black travel coffee mug on a wooden desk next to a mousepad and a pen
  3. A large black plastic travel coffee mug with a handle sits on a wooden desk.
  4. a plastic black travel mug sits on a table
  5. Pictured is a black travel coffee mug with handle and lid.

Skills needed by an AI system to automatically answer the visual question:

Quality issues in the image:

Text detected by:0 / 5 annotators

Showing images 0 - 0 out of 0 matching images.