Social Media and Speech Recognition
CUSTOM SOLUTIONS FOR SOCIAL MEDIA
We Take a Backseat to Your Product
For all intents and purposes, your customers will never know we exist.
When Out-of-the-Box Speech Recognition Does Not Work
Additionally, although free, Siri and Google speech recognition solutions have very narrow applications and are, in most cases, tied to the vendor’s own specific software applications and they are not customizable or usable by outside vendors for integration into their own software solutions.
As an example, these products:
✔ Are very narrow in their application solutions.
✔ They are not open for customization or integration into your specific applications.
✔ No specific custom acoustic or language models can be developed by the supplier.
✔ No source code is available and no specific application interfaces (APIs) are available for developers to integrate such products.
✔ There is no way to modify the acoustic models and language models for languages not offered.
✔ They do not offer any trainable artificial intelligence or dialogue management capabilities.
✔ They are also cost prohibitive if you need to bundle the stand alone app with your product.
Let MatrixHCI consult with you to determine the best approach for meeting your specific speech recognition needs. We can design a speech solution that will custom fit your desired application requirements in any language and in any acoustic environment.
Client / Server Design
Because the core of our transcription system is based on a client server architecture, the actual speech recognition engine can be housed in the Cloud where high powered MatrixHCI designed servers can process the audio and return transcribed results to your client software. This is important when you need the higher horsepower that cannot be found in smaller handheld devices or slower computers to handle the high volume of speech recognition processing.
In this case, a small thin client software component can exist on your customer client (on a smart phone, handheld device, or PC) which packages the audio collected and takes care of all management to send the audio to the server and return the results … all the time using the higher horsepower servers to do the “heavy lifting” needed for high volume real-time speech recognition.
This client can be integrated behind the scene in the back end of your client software and it can handle all the complexities of sending and receiving the audio to the server. This architectural design is much the same as that which is employed by Siri and Google where the actual speech recognition does not reside on the consumer device.
The main difference between us and Siri and Goggle is that our technology is totally customizable and is not controlled or limited to specific devices and their own applications. MatrixHCI will design the architecture using your unique needs and interface requirements and sit within your application thus speech enabling your application in a way that meets your needs.
Additionally, when the devices your applications run on have the horsepower and hardware real estate to handle the speech recognition “on device,” the MatrixHCI architectural design allows the server to actually reside on the device itself thus eliminating the need for an internet connection. This means that you will never get the “Siri Not Available. No Internet Connection” message on our custom designed product.
MatrixHCI offers Custom Software Development with an emphasis on Advanced Cutting-Edge Speech Recognition
If you have a need for specialized custom speech recognition solutions, please contact us for a free confidential consultation.