-
Notifications
You must be signed in to change notification settings - Fork 0
/
niki.html
239 lines (213 loc) · 25.6 KB
/
niki.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
<!DOCTYPE html>
<html>
<head>
<!-- Meta -->
<meta charset="utf-8" />
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1" />
<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0">
<meta name="description" content="Making Voices Heard | A study by the Centre for Internet and Society, India, supported by Mozilla Corporation" />
<!-- Title + CSS + Favicon -->
<title>Making Voices Heard</title>
<link rel="stylesheet" type="text/css" href="css/semantic.min.css">
<link rel="stylesheet" type="text/css" href="css/style.css">
<link rel="shortcut icon" type="image/x-icon" href="img/favicon.ico" />
<!-- Font Awesome -->
<script src="https://kit.fontawesome.com/4c415b9185.js" crossorigin="anonymous"></script>
</head>
<body>
<!-- Header -->
<div>
<div class="ui fluid container banner">
<div class="banner-image" aria-label="Cats are shown as people using various devices including voice interfaces in shops and houses, with a central banner that shows the title ‘Making Voices Heard’."></div>
</div>
</div>
<!-- Top Navigation Bar -->
<div class="blue nav">
<div class="ui container">
<div class="nav-entries">
<a href="index.html">Home</a>     <a href="design-brief.html">Design Brief</a> <a href="policy-brief.html">Policy Brief</a> <a href="mapping-actors.html">Mapping Actors</a> <a href="index.html#case-studies">Case Studies</a> <a href="index.html#literature-surveys">Literature Surveys</a> <a href="index.html#resources">Resources</a> <span id="report"><a href="docs/MakingVoicesHeard_FullReport.pdf"><i class="fas fa-arrow-circle-down"></i> Get Full Report</a></span>
</div>
</div>
</div>
<!-- Title -->
<div class="grey">
<div class="ui container four column stackable grid">
<div class="one wide column empty">
</div>
<div class="fourteen wide column text">
<h2>Niki</h2>
</div>
<div class="one wide column empty">
</div>
<div class="one wide column empty">
</div>
<div class="nine wide column text">
<img src="img/CaseStudy_Niki.jpg"width="100%" style="margin: 15px 0 1px 0;" alt="Cat shown as an older man speaking into a mobile phone in order to make a bill payment."/>
<h3 id="about">About</h3>
<p><strong>“Even before you do anything Niki starts to talk to you.”<sup class="superscript"><a href="#fn1">1</a></sup><a name="ref1"></a> </strong></p>
<p> Several start-ups in India work at the intersection of voice and multilingual support. Niki is one of the few that directly provides this support to the individual. It is described by its developers as an “artificial intelligence powered conversational commerce startup”.<sup class="superscript"><a href="#fn2">2</a></sup><a name="ref2"></a> This means that a person can use the app to make an array of digital payments with either text or voice commands.<sup class="superscript"><a href="#fn3">3</a></sup><a name="ref3"></a> Niki was founded in 2015 as a voice-based assistant app in English; now, it provides information in multiple Indian languages via voice and text. The app allows people to use voice commands to complete a range of tasks, including paying utility bills, recharging prepaid mobile accounts, booking tickets for travel and accommodation, and availing local deals. The team at Niki focuses on the segment they call ‘middle India’, which includes customers from Tier 2, Tier 3, and Tier 4 cities; their aim is to bring the benefits of the online economy, without the barriers of language, to these new internet users. In 2020, Niki witnessed an increase of 1,000% in revenue, with a 22% increase in their user base to 550,000 users.<sup class="superscript"><a href="#fn4">4</a></sup><a name="ref4"></a> The company has received funding from Ratan Tata, Unilazer Ventures, and SAP.iO, among others.<sup class="superscript"><a href="#fn5">5</a></sup><a name="ref5"></a>
As per a 2020 report, Niki plans to raise USD 50 million (more than 300 crore) by early 2021. The team intends to use these funds to expand their market share and capture 20% of the 150 million ‘Bharat household’ (Indian household) market by the financial year 2022.<sup class="superscript"><a href="#fn6">6</a></sup><a name="ref6"></a></p>
<h3 id="methodology">Methodology and process</h3>
<p>This section enumerates the design and user research process that the team at Niki follows to understand the needs of their users and find new ways to help them.</p>
<h4>Design process and user research</h4>
<p> The process of creating voice datasets in Indian languages involved several steps, beginning with the selection of languages that are the focus of the project and then building speech technologies using the voice datasets. The selection of the 13 languages was based on the following criteria: optimal text selection, speaker selection, pronunciation variation, recording specification, text correction for handling out-of-the-vocabulary words, and data verification.<sup class="superscript"><a href="#fn7">7</a></sup><a name="ref7"></a> To ensure the quality of data, characteristics that affect speech synthesis quality such as encoding (converting one form of data to another), sampling rate (number of samples of audio recorded every second) etc. were considered. The sentences for the speech recordings were taken through web crawlers from newspaper reports, Wikipedia pages, websites, and blogs in the respective Indian language. To achieve good coverage of topics and words, sentences were also taken from different types of literature, including children’s stories, science writing, tourism content, etc. Care was also taken to ensure that the texts were commonly used, free of errors, easy to read, and covered a wide range of words and syllables. Code-mixed sentences were avoided.</p>
<h4>Design process and user research</h4>
<p> According to the team at Niki, between 2016-2019, they have spent about 30,000 hours speaking to users. Research is a critical component of their design process. They have an in-house customer insights and research team that works towards understanding users better through various methods including on-ground research. For instance, the team of researchers, designers, and product managers travelled to Tier 3 (and below) cities such as Chomu, Pushkar, Ajmer, and Udaipur in Rajasthan to meet and undertake usability studies with residents. Usability studies are conducted to observe and understand the needs of a group of representative users.<sup class="superscript"><a href="#fn7">7</a></sup><a name="ref7"></a>They involve observing users as they attempt to complete tasks using the product.</p>
<p>The following are insights based on the key learnings from the user studies and the feedback collected in Tier 3 and Tier 4 cities:</p>
<p> Single-page and consistently guided user interface: The entire app should have a single-page user interface (UI) across services to maintain consistency. The user journey should be guided – the app should point users to the next step. Messages need to be crisp and to the point, and actionable items and messages should not be mixed. The content flows should be designed to be linear; multiple branches should be avoided.</p>
<p> Acknowledgement messages: The research showed that it is important to acknowledge every action of the individual, as it gives them the confidence to use the app. All the ‘call to action’ and actionable elements must be consolidated in a specifically identified area.</p>
<p> Apprehension about change: The research revealed that users were hesitant about trying new features for fear of failing. The team observed that they were diffident and nervous about trying anything they hadn’t previously encountered.</p>
<h3 id="languages">Multilingual and voice-based technology</h3>
<p>The founders of Niki aim to solve two structural problems: voice and vernacular. While creating the natural language processing (NLP)<sup class="superscript"><a href="#fn8">8</a></sup><a name="ref8"></a> engine for Indian languages, the team realised that sentences in most Indian languages had some words in English, creating challenges when building an interface that could understand a sentence where more than one language was used. To deal with this, they built an in-house NLP engine, with the idea that the engine could be scaled to different contexts (such as bill payment and phone recharge) and more Indian languages. To make it easier to add more languages that the system can understand, the team now only requires entries in the Natural Language Generation (NLG)<sup class="superscript"><a href="#fn9">9</a></sup><a name="ref9"></a> files. One of the other ways in which Niki was able to increase scale in adding new languages was by delaying heavy technological investments until the proof of concept was taken to users, and by making changes and upscaling based only on the feedback and success from the user research. The NLP system designed in-house provided Niki with the capacity to launch its services in different Indian languages with ease. There are three main components in the pipeline in terms of recording a response and giving an answer – a) transcribing the audio b) extracting meaning from the text, and c) responding to the individual. In the case of Niki, the first layer is provided by Google, the second layer by the NLP engine, and the third layer by dialogue management.</p>
<h4>Designing for hyper-localisation of conversations </h4>
<p> Niki claims to have a scalable design that can be adapted to multiple regional languages. When expanding to a new language, the team begins by understanding colloquial usage in specific regions and for particular uses. Based on their findings, they gauge users’ intentions, and design the app’s responses accordingly. The key challenge was to design for the hyper-localisation of conversations. In India, the immense diversity of languages and dialects is difficult to capture. The team sees this as a concern to be tackled in the future.</p>
<h3 id="privacy">Privacy and data collection</h3>
<p>One of the main challenges faced by companies and researchers working on Indian language voice interfaces (VIs) is the lack of data in several Indian languages. Niki aims to tackle this by collecting data and strengthening their NLP engine, including as many languages as possible. One of the ways in which Niki has been striving to ensure accuracy in different languages is through the annotation of data, as they always had voice as an input for data. Based on how accurate the model is in understanding a particular language, it is made available to the user. Their NLP and machine learning operations are trained to understand what the user is saying across multiple languages. They store inputs from each interaction made by the individual using the app to improve their models for various accents and dialects. This system allows them to have data with different languages, accents, volumes, pitches, word speeds, and background noises.<sup class="superscript"><a href="#fn10">10</a></sup><a name="ref10"></a></p>
<p> Niki is one of the few apps in India that provides a privacy policy in English and Hindi. The team also ensures that the financial data they collect is encrypted from the start to when the data is transferred. They also do not share personally identifiable information (PII) with third parties without the consent of the user. They practice purpose limitation internally and allow access to only the data that is required for a team to work on specific tasks.</p>
<h4>Building for Bharat</h4>
<p>In the initial years, Niki focused on creating voice-and speech-based interfaces that catered to English-speaking Indian people. During this time, they also provided voice-related services to various banks to use as their personalised voice bots. On listening to the interactions between people and these bots, the team realised that people were trying to speak to the English-only bot in their first languages. Only when they developed and launched the same chatbot in Hindi did they realise the immense potential that an Indian language–speaking chatbot had, especially in Tier 2 and Tier 3 cities in India. </p>
<p> During their extensive usability studies, they examined the reasons people buy smartphones and their device preferences. People preferred certain phones because they had big screens, their peers had them, or they wanted to watch videos on them. This is reinforced by other research that stated that people speaking Indian languages rely heavily on voice searches on non-transactional platforms like YouTube and Google.<sup class="superscript"><a href="#fn11">11</a></sup><a name="ref11"></a>
However, there seemed to be hesitation to use a voice-only input platform when money was involved, this could be due to the fear that money could be transferred unintentionally. In these cases, people preferred to ask for assistance from family members or friends to complete the transaction. During the user research, the team found that the main reason for this was the fear of losing money online. Hence, the app and the voice interface were designed to ensure user trust and create an experience similar to interacting with a family member.</p>
<h3 id="access">Accessibility and assistance</h3>
<p>Niki equates the issue of accessibility with removal of unfamiliarity, since they believe that unfamiliarity is a barrier to using new technologies or apps. The interface is made accessible with the provision of support in multiple Indian languages in a way that makes the individual comfortable; it offers a type of hand-holding while people engage with new functionalities. The interface also includes crisp replies and acknowledgement messages. Niki believes that this could lead to an improved form of accessibility – VIs in multiple Indian languages will open the “internet economy to new internet users''.<sup class="superscript"><a href="#fn12">12</a></sup><a name="ref12"></a> This includes enabling an individual to read or interact in their own language and be comfortable using the internet for more services, including some that involve monetary transactions. Niki aims to act as a guide through the individual’s journey through the app by making them comfortable, initiating conversation in their languages, and confirming each utterance to ensure an accurate record of queries.</p>
<h3 id="challenges">Challenges</h3>
<p> “The way Hindi is spoken in Bihar is different from the way Hindi is spoken in Rajasthan.”<sup class="superscript"><a href="#fn13">13</a></sup><a name="ref13"></a></p>
<p>A significant challenge for Niki and most voice interface developers in India is the huge diversity of languages that are spoken in the country. This heterogeneity of languages even among dialects creates the pressure to develop interfaces that understand languages as well as dialects. Most Indian languages are still low-resource ones without enough data, which acts as a barrier to creating VIs in these languages. Consequently the lack of less widely spoken languages available in VIs prevents them from being adopted by a number of communities.</p>
<h3 id="future">Future of Niki</h3>
<p> “Niki’s vision is that nobody’s left behind.”<sup class="superscript"><a href="#fn14">14</a></sup><a name="ref14"></a></p>
<p> Niki aims to use experience from creating local language-and-voice-based interfaces, especially for ‘middle India’ households in smaller cities, to expand their reach in the future. They aim to make the entire internet economy accessible to a large number of Indians who are coming online for the first time. Their current focus is to provide access to essential services such as rations, electricity, and phone recharging. Their focus has always been to improve the socio-economic lives of ‘middle India’ households and to further this they hope to reach more people through a voice interface that they can use in their own language.</p>
<br />
<i> Disclaimer: This is an independent case study conducted as a part of the Making Voices Heard Project, supported by the Mozilla Corporation. The researchers have not received any external remuneration as a part of this case study, and claim no conflict of interest.</i>
</div>
<div class="one wide column empty">
</div>
<div class="five wide column meta">
<p><span id="grey">Research and Writing by</span> <br />Shweta Mohandas <span id="grey">and</span> Saumyaa Naidu
<br />
<span id="grey">Review and Editing by</span> <br />Puthiya Purayil Sneha <span id="grey">and</span> Torsha Sarkar<br />
<span id="grey">Research Inputs by</span> <br />Sumandro Chattapadhyay<br />
<br />
<a href="docs/MozVoice_CaseStudies_Niki_02.pdf"><i class="fas fa-arrow-circle-down" style="color: black;" ></i> Download Niki Case study</a></p>
<br />
<hr />
<br />
<p><span style="line-height: 3em;">CONTENTS</span></p>
<p><a href="#about"><strong>About</strong></a></p>
<p><a href="#methodology"><strong>Methodology and Process</strong></a></p>
<p><a href="#languages"><strong>Multilingual and voice-based technology</strong></a></p>
<p><a href="#privacy"><strong>Privacy and data collection</strong></a></p>
<p><a href="#access"><strong>Accessibility and assistance</strong></a></p>
<p><a href="#challenges"><strong>Challenges</strong></a></p>
<p><a href="#future"><strong>Future of Niki</strong></a></p>
</div>
<div class="one wide column empty">
</div>
<div class="nine wide column text">
<div class="ten wide column content">
</div>
<div class="ten wide column content">
<br />
<h3>Notes</h3>
<table class="footnote">
<tr>
<td class="number">1</td>
<td class="reference"><a name="fn1"></a>Interview, Niki, online, Bangalore, 24 July 2020 <span class="internal-nav"><a href="#ref1">↑</a></span></td>
</tr>
<tr>
<td class="number">2</td>
<td class="reference"><a name="fn2"></a> Sangani, P., “Hindi Users Help Niki.ai Grow 300x”, The Economic Times, 5 June 2019, Accessed 05 September 2021, <a href="https://economictimes.indiatimes.com/small-biz/startups/newsbuzz/hindi-users-help-niki-ai-grow-300x/articleshow/69940200.cms"target="_blank"> https://economictimes.indiatimes.com/small-biz/startups/newsbuzz/hindi-users-help-niki-ai-grow-300x/articleshow/69940200.cms/</a> <span class="internal-nav"><a href="#ref2">↑</a></span></td>
</tr>
<tr>
<td class="number">3</td>
<td class="reference"><a name="fn3"></a>Sangani, “Hindi Users”,<em> The Economic Times.</em> <span class="internal-nav"><a href="#ref3">↑</a></span></td>
</tr>
<tr>
<td class="number">4</td>
<td class="reference"><a name="fn4"></a> Kashyaap, S., “[Product Roadmap] How Ratan Tata-backed Niki.ai is Helping Bharat Users Perform Transactions”, YourStory, 8 April 2021, <a href="https://yourstory.com/2020/08/product-roadmap-ratan-tata-nikiai-bharat-conversational-ai-startup/amp"target="_blank">https://yourstory.com/2020/08/product-roadmap-ratan-tata-nikiai-bharat-conversational-ai-startup/amp</a> <span class="internal-nav"><a href="#ref4">↑</a></span></td>
</tr>
<tr>
<td class="number">5</td>
<td class="reference"><a name="fn5"></a>Sangani, “Hindi Users”,<em> The Economic Times.</em> <span class="internal-nav"><a href="#ref5">↑</a></span></td>
</tr>
<tr>
<td class="number">6</td>
<td class="reference"><a name="fn6"></a> HT Brand Studio, “Niki Unlocks Bharat’s Internet Economy Averaging 52 Transactions per Household”, <em>Mint</em>, 12 January 2021, <a href="https://www.livemint.com/brand-post/niki-unlocks-bharat-s-internet-economy-averaging-52-transactions-per-household-11610451780112.html" target="_blank"> https://www.livemint.com/brand-post/niki-unlocks-bharat-s-internet-economy-averaging-52-transactions-per-household-11610451780112.html.</a> <span class="internal-nav"><a href="#ref6">↑</a></span></td>
</tr>
<tr>
<td class="number">7</td>
<td class="reference"><a name="fn7"></a> “Usability Testing”, <em>Interaction Design</em>, accessed 3 November 2021, <a href="https://www.interaction-design.org/literature/topics/usability-testing"target="_blank"> https://www.interaction-design.org/literature/topics/usability-testing</a> <span class="internal-nav"><a href="#ref7">↑</a></span></td>
</tr>
<tr>
<td class="number">8</td>
<td class="reference"><a name="fn8"></a>NLP is the automatic manipulation of natural language, like speech and text, by software. <span class="internal-nav"><a href="#ref8">↑</a></span></td>
</tr>
<tr>
<td class="number">9</td>
<td class="reference"><a name="fn9"></a>The software process that produces a natural language output. The NGL process converts machine code into human language output. <span class="internal-nav"><a href="#ref9">↑</a></span></td>
</tr>
<tr>
<td class="number">10</td>
<td class="reference"><a name="fn10"></a> hippoBrain “E41: Full-stack Ramu Kaka - A Product to Service the Bharat Market by Sachin Jaiswal & Shishir Modi”, <em>YouTube</em> 2 April 2021, <a href="https://www.youtube.com/watch?v=bn4h9jsmfcg&ab_channel=hippoBrainhippoBrain."target="_blank"> https://www.youtube.com/watch?v=bn4h9jsmfcg&ab_channel=hippoBrainhippoBrain.</a> <span class="internal-nav"><a href="#ref10">↑</a></span></td>
</tr>
<tr>
<td class="number">11</td>
<td class="reference"><a name="fn11"></a>Interview, Niki, online, Bangalore, 24 July 2020 <span class="internal-nav"><a href="#ref11">↑</a></span></td>
</tr>
<tr>
<td class="number">12</td>
<td class="reference"><a name="fn12"></a>Interview, Niki, online, Bangalore, 24 July 2020 <span class="internal-nav"><a href="#ref12">↑</a></span></td>
</tr>
<tr>
<td class="number">13</td>
<td class="reference"><a name="fn13"></a>Interview, Niki, online, Bangalore, 24 July 2020 <span class="internal-nav"><a href="#ref13">↑</a></span></td>
</tr>
<tr>
<td class="number">14</td>
<td class="reference"><a name="fn14"></a>Interview, Niki, online, Bangalore, 24 July 2020 <span class="internal-nav"><a href="#ref14">↑</a></span></td>
</tr>
</table>
</div>
</div>
<div class="six wide column empty">
</div>
</div>
</div>
</div>
<!-- Footer -->
<div class="footer">
<div class="ui container four column stackable grid">
<div class="one wide column empty">
</div>
<div class="five wide column">
<h3>About the Study</h3>
<p>We believe that voice interfaces have the potential to democratise the use of the internet by addressing limitations related to reading and writing on digital text-only platforms and devices. This report examines the current landscape of voice interfaces in India, with a focus on concerns related to privacy and data protection, linguistic barriers, and accessibility for persons with disabilities (PwDs). This project was undertaken with support by the Mozilla Corporation.</p>
</div>
<div class="five wide column">
<h3>Research Team</h3>
<p><em>Research</em> Shweta Mohandas, Saumyaa Naidu, Deepika Nandagudi Srinivasa, Divya Pinheiro, Sweta Bisht</p>
<p><em>Conceptualisation, Planning, and Research Inputs</em> Sumandro Chattapadhyay, Puthiya Purayil Sneha</p>
<p><em>Illustration</em> Kruthika NS (Instagram @theworkplacedoodler)</p>
<p><em>Website Design</em> Saumyaa Naidu</p>
<p><em>Website Development</em> Sumandro Chattapadhyay, Pranav M Bidare</p>
<p><em>Review and Editing</em> Puthiya Purayil Sneha, Divyank Katira, Pranav M Bidare, Torsha Sarkar, Pallavi Bedi, Divya Pinheiro</p>
<p><em>Copy Editing</em> The Clean Copy</p>
</div>
<div class="four wide column">
<h3>Copyright and Credits</h3>
<p>Copyright: <a href="http://cis-india.org/" target="_blank">CIS, India</a>, 2021<br />License: <a href="https://creativecommons.org/licenses/by/4.0/" target="_blank">CC BY 4.0 International</a></p>
<p>Built using <a href="https://semantic-ui.com/" target="_blank">Semantic UI</a><br/><a href="https://fonts.google.com/specimen/Barlow" target="_blank">Barlow</a> and <a href="https://fonts.google.com/specimen/Open+Sans" target="_blank">Open Sans</a> by <a href="https://fonts.google.com/" target="_blank">Google Fonts</a><br/>Social media icons by <a href="https://fontawesome.com/" target="_blank">Font Awesome</a><br/>Hosted on <a href="https://github.com/cis-india/mozvoice" target="_blank">GitHub</a></p>
</div>
<div class="one wide column empty">
</div>
<div class="sixteen wide column">
<div style="float: center; clear: both;">
<a href="https://cis-india.org/" target="_blank" style="border-bottom: 0px solid"><img src="img/logo.png" alt="The Centre for Internet and Society, India" class="logo" /></a>
</div>
<div class="icons" style="float: center; clear: both;">
<a href="https://www.instagram.com/cis.india/" target="_blank"><i class="fab fa-instagram fa-lg"></i></a> <a href="https://twitter.com/cis_india" target="_blank"><i class="fab fa-twitter fa-lg"></i></a> <a href="https://www.youtube.com/channel/UC0SLNXQo9XQGUE7Enujr9Ng" target="_blank"><i class="fab fa-youtube fa-lg"></i></a></p>
</div>
</div>
</div>
</div>
</body>
</html>