声音识别SDK

  • VeriSpeak SDK

 

VERISPEAK SDK
SPEAKER RECOGNITION FOR STAND-ALONE OR WEB APPLICATIONS
VeriSpeak voice identification technology is designed for biometric system developers and integrators. The text-dependent speaker recognition algorithm assures system security by checking both voice and phrase authenticity. Voiceprint templates can be matched in 1-to-1 (verification) and 1-to-many (identification) modes.
Available as a software development kit that enables the development of stand-alone and Web-based speaker recognition applications on Microsoft Windows, Linux, Mac OS X, iOS and Android platforms.

VERISPEAK SDK
单机系统或WEB应用程序中的语音识别技术
VeriSpeak语音识别技术是为生物识别系统开发人员和集成商设计的。文本相关的语音识别算法通过检查语音和短语的真实性来保证系统的安全性。声纹模板可以在1:1(验证)和1 t:N(识别)模式下进行快速准确匹配.
作为一种软件开发工具包,可以在Microsoft Windows、Linux、Mac OS X、iOS和Android平台上开发单机和基于Web的语音识别应用程序。。

 

FEATURES AND CAPABILITIES

  • Text-dependent algorithm prevents unauthorized access with a covertly-recorded user voice.
  • Two-factor authentication by checking voice biometrics and pass-phrase authenticity.
  • Regular microphones and smartphones are suitable for recording user voices.
  • Available as a multiplatform SDK that supports multiple programming languages.
  • Reasonable prices, flexible licensing and free customer support.

产品特点和功能

  • 与文本相关的算法防止使用秘密录制的用户语音进行未经授权的访问。
  • 双因素认证,检查语音生物特征和通过短语的真实性。
  • 可使用普通的麦克风和智能手机记录用户的声音。
  • 支持多种编程语言、跨平台运行。
  • 合理的价格,灵活的许可和免费的客户支持。

The VeriSpeak algorithm implements voice enrollment and voiceprint matching using proprietary sound processing technologies:

  • Text-dependent algorithm. The text-dependent speaker recognition is based on saying the same phrase for enrollment and verification. The VeriSpeak algorithm determines if a voice sample matches the template that was extracted from a specific phrase. During enrollment, one or more phrases are requested from the person being enrolled. Later that person may be asked to pronounce a specific phrase for verification. This method assures protection against the use of a covertly recorded random phrase from that person.
  • Two-factor authentication with a passphrase. The VeriSpeak voiceprint-matching algorithm can be configured to work in a scenario where each user records a unique phrase (such as passphrase or an answer to a "secret question" that is known only by the person being enrolled). Later a person is recognized by his or her own specific phrase with a high degree of accuracy. The overall system security increases as both voice authenticity and passphrase are checked.
  • Text-independent algorithm. The phrase-independent speaker recognition uses different phrases for user enrollment and recognition. This method is more convenient, as it does not require each user to remember the passphrase. It may be combined with the text-dependent algorithm to perform faster text-independent search with further phrase verification using the more reliable text-dependent algorithm.
  • Automatic voice activity detection. VeriSpeak is able to detect when users start and finish speaking.
  • Liveness detection. A system may request each user to enroll a set of unique phrases. Later the user will be requested to say a specific phrase from the enrolled set. This way the system can ensure that a live person is being verified (as opposed to an impostor who uses a voice recording).
  • Identification capability. VeriSpeak functions can be used in 1-to-1 matching (verification) and 1-to-many (identification) modes.
  • Multiple samples of the same phrase. A template may store several voice records with the same phrase to improve recognition reliability. Certain natural voice variations (i.e. hoarse voice) or environment changes (i.e. office and outdoors) can be stored in the same template.
  • Fused matching. A system may ask users to pronounce several specific phrases during speaker verification or identification and match each audio sample against records in the database. The VeriSpeak algorithm can fuse the matching results for each phrase together to improve matching reliability.

VeriSpeak算法使用专有的声音处理技术实现语音登录和声纹匹配:

  • 文本相关算法文本相关的语音识别是基于相同的词组用于注册和验证。VeriSpeak算法确定语音样本是否与从特定短语中提取的模板匹配。在注册期间,将向被注册的人员请求一个或多个短语。稍后,该人可能被要求一个特定的短语进行核实。这种方法确保了不被对使用者秘密录音的随机短语攻击。
  • 使用密码的双因素身份验证 可以将VeriSpeak声纹匹配算法设置为每个用户记录一个独特的、只为该人所知的短语(如密码或“秘密问题”的答案)进行登记。然后系统使用他或她自己的特定短语进行识别,这方法具有很高的准确性,因为它相当于同时检测语音真实性和核对了密码,所以整个系统的安全性大幅提高。
  • 文本无关算法。短语独立的说话人识别使用不同的短语进行用户注册和识别。这种方法更方便,因为它不需要每个用户记住密码。它可以与文本相关算法结合使用更可靠的文本相关算法来执行更快的文本无关搜索和进一步的短语验证。
  • 自动语音活动检测。VeriSpeak能够检测用户何时开始和结束说话。
  • 活体检测 系统可以请求每个用户注册一组独特的短语。稍后,用户将被要求从登记的集合中说出一个特定的短语。通过这种方式,系统可以确保真实的验证者,而不是一个使用语音录音的冒名顶替者。
  • 识别能力 VeriSpeak函数可用于1:1匹配(验证)和1N(识别)模式。
  • 同一短语的多个样本 模板可以用相同的短语存储多个语音记录,以提高识别的可靠性。某些自然声音变化(如声音嘶哑)或环境变化(如室内和室外)可以存储在同一模板中。
  • 融合匹配 系统可以要求用户在语音验证或识别过程中说出几个特定短语,并将每个音频样本与数据库中的记录进行匹配。VeriSpeak算法可以将每个阶段的匹配结果融合在一起,从而提高匹配的可靠性。