AI Technology

Custom TTS

Custom TTS

Innovative TTS (Text-to-Speech) that changes our lives

Our lives are changing with innovative TTS
(Text-to-Speech)

Text information is converted to natural sounding voice to deliver to users. This technology is based on artificial intelligence and natural language processing, and supports a wide range of languages, voices, and tones, providing a broad range of use cases. Users can obtain high-quality voice output through simple text input, thereby enhancing information accessibility and user experience.

ESTsoft leads the innovative change in content creation environments and offers business growth opportunities in various industries. In particular, ESTsoft possesses unique domestic AI technology that perfectly and diversely creates individuals with not only a physical appearance but also a Virtual Identity, and commercializes and services it.

ESTsoft leads the innovative changes in the content creation environment and provides business growth opportunities in various industries. In particular, ESTsoft has the unique domestic artificial intelligence AI Human technology that perfectly and diversely creates characters with not only appearances but also virtual identities, and is commercializing and servicing it.

What is STV technology?

It is about changing the input video to match the shape of the mouth corresponding to the voice of the entered person and generating an output.
This includes the process of analyzing various voice characteristics such as pitch, intensity, duration, etc., and mapping this to mouth shapes to create a video.

ESTsoft leads the innovative changes in the content creation environment and provides business growth opportunities in various industries. In particular, ESTsoft has the unique AI Human technology in Korea that can perfectly and diversely create people with not only appearances but also virtual identities (Virtual Identity), and it is commercializing and servicing it.

This involves changing the input video to match the mouth shape of the person's spoken voice and generating an output. It includes the process of analyzing various vocal characteristics such as pitch, intensity, duration, and mapping these to mouth shapes to create the video.

Preprocessing stage

Preprocessing stage

  1. Data refinement

  1. Data refining

Only selects appropriate videos. There are no noises and the videos articulate well.

Only select appropriate videos. There's no noise and the video enunciates properly.

  1. Data Conversion

  1. Data conversion

Converting to a format that deep learning models can understand and process
Convert the voice into a form that can be input into the model, and extract the part where the person appears in the video

Deep learning models are converted into a format that can be understood and processed
Audio is converted into a form that can be input into the model, and video extracts the part where the person appears

Deep learning training

Deep learning training

Input preprocessed data into the model, and train by comparing deep learning outputs and correct answers.

The process involves inputting the preprocessed data above into the model, and training by comparing the deep learning output and the correct answers.

Preprocessing stage

  1. Data refinement

Selects only appropriate videos. No noise and properly spoken video

  1. Data Conversion

Converting to a format that the deep learning model can understand and process. Convert the audio into a format that can be input into the model, and extract the part where the person appears in the video.

Deep learning training

The process involves inputting the preprocessed data above into the model, and training it by comparing the deep learning output and the correct answers.

Strength of Technology

Strengths of Technology

The most significant feature of STV is that it generates lip movements that match the spoken words, not just for the original video's language or voice, but also for other languages or any other arbitrary voices. In other words, STV can accommodate a wide variety of languages and diverse voice characteristics.

The most significant feature of STV is that the lip synchronization is generated to match the spoken words, not just in the same language or voice as the original video, but also in other languages or different arbitrary voices. That is, STV can accommodate various languages and a wide range of voice characteristics.

The most significant feature of STV is that it generates lip movements that match the spoken words, not only in the original language and voice of the person in the video, but also in other languages or different arbitrary voices. In other words, STV can accommodate various languages and diverse voice characteristics.

Utilization of technology

Utilization of technology

STV is opening up new creative possibilities through the combination of voice and video technology. The advancement of this technology is expected to make the future of digital media more interesting and diverse.

STV is opening new creative possibilities through the combination of voice and video technology. The advancement of this technology is expected to make the future of digital media more interesting and diverse.

STV is opening new creative possibilities through the combination of voice and video technology. The advancement of this technology is expected to make the future of digital media more interesting and diverse.

  1. 1.

    AI senior care service that takes responsibility for seniors' Fun and cognitive enhancement with AI human technology

  2. 2.

    Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
    Expansion of educational businesses in various fields such as AI content

  3. 3.

    Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
    Creating and utilizing various AI human content such as new employee analysts, announcers, etc.

  4. 4.

    Companies can focus on their inherent customer value by providing data and solutions using AI

    as an API.

  5. 5.

    Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
    provides the utility environment that users want.

  1. 1.

    AI senior care service that takes responsibility for seniors' Fun and cognitive enhancement with AI human technology

  2. 2.

    Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
    Expansion of educational businesses in various fields such as AI content

  3. 3.

    Implementing 'moving pictures' by applying EST AI technology, producing various AI human contents such as 'face transformation, makeup application, and clothing creation' for new employees including analysts and announcers, and utilizing them

  4. 4.

    Companies can focus on their inherent customer value by providing data and solutions using AI
    as an API.

  5. 5.

    Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
    provides the utility environment that users want.

  1. 1.

    AI senior care service that takes responsibility for seniors' Fun and cognitive enhancement with AI human technology

  2. 2.

    Celebrity instructor video lecture creation, TOEIC speaking education content production, as a fitness training instructor
    Expansion of educational businesses in various fields such as AI content

  3. 3.

    Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
    Creating and utilizing various AI human content such as new employee analysts, announcers, etc.

  4. 4.

    We provide data and solutions utilizing AI through APIs to enable companies to focus on their inherent customer value.

  5. 5.

    Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
    provides the utility environment that users want.

  1. 1.

    AI senior care service that takes responsibility for seniors' Fun and cognitive enhancement with AI human technology

  2. 2.

    Expansion of educational businesses in various fields, such as the establishment of celebrity lecture video courses, production of TOEIC speaking educational content, and AI content as a fitness training instructor

  3. 3.

    Implementing 'moving pictures' with EST AI technology, 'face transformation, makeup application, and clothing creation' through deep learning
    Creating and utilizing various AI human content such as new employee analysts, announcers, etc.

  4. 4.

    We provide data and solutions utilizing AI through APIs to enable companies to focus on their intrinsic customer value.

  5. 5.

    Background removal technology applied in ALSee Capture, like the smooth design of ESTsoft AI technology and ALTools products,
    provides the utility environment that users want.

LET'S Connect

CEO: Sangwon Jung

Business Registration Number 229-81-03214 Mail-Order Business Notification Number 2011-Seoul Seocho-1962

EST Building, 3 Banpo-daero, Seocho-gu, Seoul (Postal Code)06711

Family Site

LET'S Connect

CEO: Sangwon Jung

Business Registration Number 229-81-03214 Mail-Order Business Notification Number 2011-Seoul Seocho-1962

EST Building, 3 Banpo-daero, Seocho-gu, Seoul (Postal Code)06711

Family Site

LET'S Connect

CEO: Sangwon Jung

Business Registration Number 229-81-03214 Mail-Order Business Notification Number 2011-Seoul Seocho-1962

EST Building, 3 Banpo-daero, Seocho-gu, Seoul (Postal Code)06711

Family Site

LET'S Connect

CEO: Sangwon Jung

Business Registration Number 229-81-03214 Mail-Order Business Notification Number 2011-Seoul Seocho-1962

EST Building, 3 Banpo-daero, Seocho-gu, Seoul (Postal Code)06711

Family Site