Yandex SpeechKit Integration for Speech Recognition

We design and deploy artificial intelligence systems: from prototype to production-ready solutions. Our team combines expertise in machine learning, data engineering and MLOps to make AI work not in the lab, but in real business.
Showing 1 of 1 servicesAll 1566 services
Yandex SpeechKit Integration for Speech Recognition
Simple
from 1 business day to 3 business days
FAQ
AI Development Areas
AI Solution Development Stages
Latest works
  • image_website-b2b-advance_0.png
    B2B ADVANCE company website development
    1215
  • image_web-applications_feedme_466_0.webp
    Development of a web application for FEEDME
    1161
  • image_websites_belfingroup_462_0.webp
    Website development for BELFINGROUP
    852
  • image_ecommerce_furnoro_435_0.webp
    Development of an online store for the company FURNORO
    1041
  • image_logo-advance_0.png
    B2B Advance company logo design
    561
  • image_crm_enviok_479_0.webp
    Development of a web application for Enviok
    823

Yandex Speech

Kit integration for speech recognition Yandex SpeechKit is a Russian STT service optimized for the Russian language, Russian names, and realities. Infrastructure in the Russian Federation - data does not leave the country. WER for Russian spoken audio: 10-15%, for clear speech: 5-8%. ### Advantages for Russian-language tasks - Better quality in Russian compared to Whisper in telephony conditions - Recognition of names and toponyms of the Russian Federation without additional configuration - FSTEC compliance with on-premise deployment (SpeechKit Enterprise) - Integration with the Yandex Cloud ecosystem: Object Storage, API Gateway ### Streaming recognition via gRPC```python import grpc from yandex.cloud.ai.stt.v3 import stt_pb2, stt_pb2_grpc, stt_service_pb2

channel = grpc.secure_channel('stt.api.cloud.yandex.net:443', grpc.ssl_channel_credentials()) stub = stt_pb2_grpc.RecognizerStub(channel)

recognize_options = stt_pb2.StreamingOptions( recognition_model=stt_pb2.RecognitionModelOptions( audio_format=stt_pb2.AudioFormatOptions( raw_audio=stt_pb2.RawAudio( audio_encoding=stt_pb2.RawAudio.LINEAR16_PCM, sample_rate_hertz=16000, audio_channel_count=1 ) ), language_restriction=stt_pb2.LanguageRestrictionOptions( restriction_type=stt_pb2.LanguageRestrictionOptions.WHITELIST, language_code=['ru-RU'] ), text_normalization=stt_pb2.TextNormalizationOptions( text_normalization=stt_pb2.TextNormalizationOptions.TEXT_NORMALIZATION_ENABLED, profanity_filter=False, literature_text=True ) ) )