Audio signifiers

Research by design

TL; DR Skimming a website is difficult for visually impaired users because they do not have access to visual cues. This concept works with screen readers to enable the eyes-free skimming of websites.

Duration: 12 months

Image of the manage signfiers window that allows users to customize audio signifiers and a set of tutorials for new users to learn using audio signifiers.

Location: Capstone project funded by Mozilla Research Grant 2018
Role: Interaction Design, UX Research, Independent project

Methods: Research by design, Moderated remote usability tests,
Tools: Sketch, Dreamweaver, Javascript, Web Audio APIs, Trello

Current screen readers require visiting each item sequentially.

Thus, visually impaired users need a lot of time to get a high-level overview of the web content. However, sighted users rely on visual cues to quickly skim through web content. This gap in web experience hinders the ability to explore many websites in a short time, which can be frustrating.

Key insights from interviews, contextual inquiry, and user evaluation.

Users are unaware of valuable content

Visually impaired users miss out on the usefulness of the website because valuable content is difficult to reach quickly.

Clueless about changes on a website

Many changes on the websites are conveyed visually, making changes difficult to identify and unclear.

Difficulty to find error source

Many users find it difficult to trace the source of validation errors while filling forms.

Need for earmarking or personalization

The ability to add custom markers or flags allows users to retrace the web content enabling selective focus.

Audio signifiers facilitate eyes-free skimming via screen readers.

This concept complements screen reader output without being obtrusive. Further, the sounds are inspired by real-world counterparts to add materiality to the web experience.

Audio signifiers are used to convey current status of the website. To mark important or new content. To convey events, hierarchy, or placement. To grab user attention when errors occur.

Discovering pain points through user interviews and contextual inquiry.

Current screen readers do not provide a quick overview of the website. Hence, visually impaired users have to sequentially tab through several UI elements before they can decide whether the website is useful. This requires a lot of time.

Image of a visually impaired interviewed while using the computer.

“Sometimes, I do not even pay attention to the screen reader. I quickly go to the part which I need by tabbing.”

- Software Developer

"I do online shopping, read the news, or connect with my friends on social networking sites. But, I miss out on so much content until someone says something."

- Principal Accessibility Consultant

“Often, I am unaware of what changed on the website if I click on something. For example, am I already logged in?”

- Social Worker

"Learning screen readers was not easy. It took me a while to get used to it. So I played a lot of games!"

- Undergraduate Student

Looking from the users' perspective through archetypes or persona.

I realized that any design change should make learning screen readers easier for new users. But, it should not hinder experienced users who have been using it for more than 20 years. Creating persona inspired by user stories helped to portray user intentions.

"Using JAWS is second nature to me. It has been there since I was 15, I know it is not perfect, but it works."

-Jennifer

"Getting used to a screen reader has been a challenge. I had to practice a lot to get this good!"

-Dillon

Brainstorming broad concepts to kickstart ideation.

I began by speculating a future where web experience is conveyed by audio instead of sight. Then I mapped several discernable properties of sound to replace visual cues. This led to acoustic experience ideas beneficial towards six kinds of websites.

Concept 1 is a grid wall-based acoustic experience for online shopping sites. Concept 2 is ripples based acoustic experience for social networking sites. Concept 3 is an ellipse acoustic experience for online streaming sites. Concept 4 is a fluid acoustic experience for storytelling or portfolios. Concept 5 is a layered acoustic experience for news, magazines, and galleries. Concept 6 is a concave acoustic experience for directories or contact pages.

Key learning: Several properties of sound such as pitch, frequency, position, distance, stereo image, space (wet/dry reverb percentage, type of reflections), envelope (Attack, Decay, Sustain, Release) have functional qualities. Moreover, cohesiveness and compositional properties have aesthetic qualities.

Testing the prototypes with participants

After building the prototypes, moderated remote evaluations were done with 10 participants. First, the participants were asked to familiarize themselves with sounds and their purpose. Later they interacted with the prototypes, which imitated the problematic situations. Finally, participants were asked to describe their experience by filling the google form. Let's look at the key findings.

View prototypes now

Image from usability study over zoom video conference.

70% of participants confirmed that audio signifiers enable the skimming of web content. Average time of 18 min to learn sounds and the associated events they represent. Average five number of attempts to learn sounds and the associated events. Less than 1-minute average time to learn sounds when participants chose custom sounds. Participants could remember 4 out of 7 sounds consistently at any given time. 6 out of 10 participants felt audio signifiers aid learning screen readers.

Tutorials and customizing sounds promote user acceptance.

User evaluations demonstrated the usefulness of audio signifiers in specific scenarios. Additionally, it unveiled that users needed tutorials to learn various ways to use audio signifiers. Moreover, they expressed the need for customization. Hence, I created a workflow to enable customization and learning through tutorials.

Image of the workflow for audio signifier settings where users can turn on/off sounds, play tutorials, and add new audio signifiers.

Reflections

I felt the pressure of achieving deadlines as the project was funded by the Mozilla research grant. My ability to handle pressure and doing whatever it takes enhanced greatly!

Recruiting participants was tricky. Hence, I had to think out of the box and publish podcast-style audio to get people interested.

The next steps in this project are conducting remote moderated evaluation studies for Phase 3 and writing a deadline ready research paper.

Caricature of a sighted user who relies on the vision for interacting with the web. Plus, a visually impaired user who relies on hearing.