AI Technology

Custom TTS

Innovative TTS (Text-to-Speech) that changes our lives

Our lives are changing with innovative TTS
(Text-to-Speech)

Text information is converted to natural sounding voice to deliver to users. This technology is based on artificial intelligence and natural language processing, and supports a wide range of languages, voices, and tones, providing a broad range of use cases. Users can obtain high-quality voice output through simple text input, thereby enhancing information accessibility and user experience.

ESTsoft leads the innovative change in content creation environments and offers business growth opportunities in various industries. In particular, ESTsoft possesses unique domestic AI technology that perfectly and diversely creates individuals with not only a physical appearance but also a Virtual Identity, and commercializes and services it.

ESTsoft leads the innovative changes in the content creation environment and provides business growth opportunities in various industries. In particular, ESTsoft has the unique domestic artificial intelligence AI Human technology that perfectly and diversely creates characters with not only appearances but also virtual identities, and is commercializing and servicing it.

What is STV technology?

It is about changing the input video to match the shape of the mouth corresponding to the voice of the entered person and generating an output.
This includes the process of analyzing various voice characteristics such as pitch, intensity, duration, etc., and mapping this to mouth shapes to create a video.

ESTsoft leads the innovative changes in the content creation environment and provides business growth opportunities in various industries. In particular, ESTsoft has the unique AI Human technology in Korea that can perfectly and diversely create people with not only appearances but also virtual identities (Virtual Identity), and it is commercializing and servicing it.

This involves changing the input video to match the mouth shape of the person's spoken voice and generating an output. It includes the process of analyzing various vocal characteristics such as pitch, intensity, duration, and mapping these to mouth shapes to create the video.

Preprocessing stage

Data refinement

Data refining

Only selects appropriate videos. There are no noises and the videos articulate well.

Only select appropriate videos. There's no noise and the video enunciates properly.

Data Conversion

Data conversion

Converting to a format that deep learning models can understand and process
Convert the voice into a form that can be input into the model, and extract the part where the person appears in the video

Deep learning models are converted into a format that can be understood and processed
Audio is converted into a form that can be input into the model, and video extracts the part where the person appears

Deep learning training

Input preprocessed data into the model, and train by comparing deep learning outputs and correct answers.

The process involves inputting the preprocessed data above into the model, and training by comparing the deep learning output and the correct answers.

Preprocessing stage

Data refinement

Selects only appropriate videos. No noise and properly spoken video

Data Conversion

Converting to a format that the deep learning model can understand and process. Convert the audio into a format that can be input into the model, and extract the part where the person appears in the video.

Deep learning training

The process involves inputting the preprocessed data above into the model, and training it by comparing the deep learning output and the correct answers.

Diagram of face recognition technology, featuring face detection, landmark detection, segmentation, and face editing.

Strength of Technology

Strengths of Technology

The most significant feature of STV is that it generates lip movements that match the spoken words, not just for the original video's language or voice, but also for other languages or any other arbitrary voices. In other words, STV can accommodate a wide variety of languages and diverse voice characteristics.

The most significant feature of STV is that the lip synchronization is generated to match the spoken words, not just in the same language or voice as the original video, but also in other languages or different arbitrary voices. That is, STV can accommodate various languages and a wide range of voice characteristics.

The most significant feature of STV is that it generates lip movements that match the spoken words, not only in the original language and voice of the person in the video, but also in other languages or different arbitrary voices. In other words, STV can accommodate various languages and diverse voice characteristics.

Utilization of technology

STV is opening up new creative possibilities through the combination of voice and video technology. The advancement of this technology is expected to make the future of digital media more interesting and diverse.

STV is opening new creative possibilities through the combination of voice and video technology. The advancement of this technology is expected to make the future of digital media more interesting and diverse.

Global SaaS with AI
A scalable Human SaaS service that can be accessed from anywhere in the world using AI technology
Senior care with AI
Interactive AI human supports guidance, consultation, and interaction both offline and online. Expanding as a service hub without language barriers in retail, tourism, entertainment, exhibitions, manufacturing, and public sectors.
Alan Agentic with AI
Artificial intelligence multi-agent that goes beyond AI search and reaches solutions for problem solving
Education with AI
Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
Expansion of educational businesses in various fields such as AI content
Content with AI
Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
Creating and utilizing various AI human content such as new employee analysts, announcers, etc.
API business with AI
Companies can focus on their inherent customer value by providing data and solutions using AI
as an API.
Software with AI
Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
provides the utility environment that users want.

Senior care with AI
A scalable Human SaaS service that can be accessed from anywhere in the world using AI technology
Senior care with AI
Interactive AI human supports guidance, consultation, and interaction both offline and online. Expanding as a service hub without language barriers in retail, tourism, entertainment, exhibitions, manufacturing, and public sectors.
Alan Agentic with AI
Artificial intelligence multi-agent that goes beyond AI search and reaches solutions for problem solving
Education with AI
Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
Expansion of educational businesses in various fields such as AI content
Content with AI
Implementing 'moving pictures' by applying EST AI technology, producing various AI human contents such as 'face transformation, makeup application, and clothing creation' for new employees including analysts and announcers, and utilizing them
API business with AI
Companies can focus on their inherent customer value by providing data and solutions using AI
as an API.
Software with AI
Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
provides the utility environment that users want.

Global SaaS with AI
A scalable Human SaaS service that can be accessed from anywhere in the world using AI technology
Senior care with AI
Interactive AI human supports guidance, consultation, and interaction both offline and online. Expanding as a service hub without language barriers in retail, tourism, entertainment, exhibitions, manufacturing, and public sectors.
Alan Agentic with AI
Artificial intelligence multi-agent that goes beyond AI search and reaches solutions for problem solving
Education with AI
Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
Expansion of educational businesses in various fields such as AI content
Content with AI
Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
Creating and utilizing various AI human content such as new employee analysts, announcers, etc.
API business with AI
We provide data and solutions utilizing AI through APIs to enable companies to focus on their inherent customer value.
Software with AI
Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
provides the utility environment that users want.

Senior care with AI
A scalable Human SaaS service that can be accessed from anywhere in the world using AI technology
Senior care with AI
Interactive AI human supports guidance, consultation, and interaction both offline and online. Expanding as a service hub without language barriers in retail, tourism, entertainment, exhibitions, manufacturing, and public sectors.
Alan Agentic with AI
Artificial intelligence multi-agent that goes beyond AI search and reaches solutions for problem solving
Education with AI
Expansion of educational businesses in various fields, such as the establishment of celebrity lecture video courses, production of TOEIC speaking educational content, and AI content as a fitness training instructor
Content with AI
Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
Creating and utilizing various AI human content such as new employee analysts, announcers, etc.
API business with AI
We provide data and solutions utilizing AI through APIs to enable companies to focus on their intrinsic customer value.
Software with AI
Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
provides the utility environment that users want.