:::

【2020 Application Example】 AI Voice Synthesis Module, Bringing Warmth to Machine Narration

In response to current trends, digital learning and mobile educational materials have attracted widespread attention!

With rapid technological advancements, effectively nurturing professionals who can 'adapt to developmental changes' is a critical concern that many businesses continually consider. Over recent years, various enterprises have progressively integrated 'digital learning' into employee training programs to enhance educational outcomes, thus bringing 'digital learning' and 'mobile educational materials' into the limelight.

Outsourced narration is costly and cannot handle large volumes of demand

Differences in the digital educational material production process before and after the implementation of the AI voice synthesis system

▸ Differences in the digital educational material production process before and after the implementation of the AI voice synthesis system

Strategic Breakthrough Corporation of Taiwan has assisted companies in converting many seminars, physical courses, and training events conducted by public sectors into digital materials in the past years. However, during the conversion process, it required inviting teachers, finding and renting filming locations, and post-production of recordings and videos. During recording, issues such as speakers' nervousness, discomfort in front of cameras, or mispronunciations might lead to poor recording quality or constant retakes.

Though there was an option to provide customer-specific educational material narration, the outsourcing costs were high and could not handle the demand efficiently. Therefore, there was a hope to introduce AI speech synthesis technology and develop an 'Intelligent Voice Synthesis Module' to instantly convert text on slides into natural, human-like voice files, thus saving on narration costs.

Realistic Intelligent Voice Synthesis Module, providing a diversified selection of voices

AI Voice Synthesis Module Illustration

▸ AI Voice Synthesis Module Illustration

Strategic Corporation of Taiwan collaborated with the AI technology team, Magic Cube Digital Ltd., using Tacotron2 combined with WaveNet and Tacotron features. Characters are embedded into Mel-scale spectrogram plots, then a modified WaveNet model acting as the vocoder synthesizes waveform in the time domain from these spectrograms, finally developing an MOS (Mean Opinion Score) for voice quality evaluation that approximates human-like intelligent voice synthesis modules.

This AI Intelligent Voice Synthesis Module, after being tested by testers using the MOS voice quality evaluation standard, received a score of 4.3, meeting the initial project target score of 4.21 and surpassing WaveNet's score of 4.08, thereby demonstrating exceptional effectiveness!

AI Intelligent Voice Synthesis Module, reducing costs and increasing profits, will effectively enhance Taiwan's digital learning industry environment!

Costs have been significantly reduced after the implementation of the AI voice system, and profits have increased relatively

▸ Costs have been significantly reduced after the implementation of the AI voice system, and profits have increased relatively

This AI Intelligent Voice Synthesis Module not only reduces the cost of producing digital educational materials but also solves the difficulties faced by Taiwan's industry, government, and academia in spreading digital educational materials. It can effectively enhance the efficiency of customers in producing digital teaching materials, significantly reduce labor shortages, and cost structural risks, and improve profitability.

Strategic Corporation of Taiwan will also continue to develop the 'Intelligent Transcription Module' and introduce Robotic Process Automation (RPA) to replace the current manual processes, such as captioning, dubbing, and file conversion in the production of digital educational materials, assisting in the transformation and enhancement of the domestic digital learning industry.

「Translated content is generated by ChatGPT and is for reference only. Translation date:2024-05-19」

Recommend Cases

這是一張圖片。 This is a picture.
Realizing the dream of unmanned stores, Magpie Life is building the future of the smartphone industry

"The DNA of Magpie Life is not limited to vending machines We believe that vending machines combine technology, access, and humanities to bring us exciting results" This is a sentence on the official website of Magpie Life Let the vending machines bring To live a pleasant life and build a considerate, technological and sustainable future for the smartphone industry is also the original intention of Magpie Life Founded in 2018, Magpie Life launched Taiwan’s first private-brand mobile payment scan code sensor 4 months after its establishment, completing the consumption experience through screen touch The Magpie U1 smart vending machine manages the POS system and gathers data in the background, allowing consumers to synchronize with the world's new retail pace and experience a new retail consumption experience of purchasing convenience, checkout security, visual entertainment, and improved logistics replenishment efficiency Traditional vending machines lack information visibility and AI technology assists in information transparencyThis time, the Magpie smart vending machine is also equipped with AI technology to provide adjustable shelf space , a vending machine equipped with an industrial computer and a large-size touch display screen to achieve the purpose of a store-less store Magpie Life stated that the biggest problem with traditional vending machines is the lack of information visibility To check inventory, replenishment personnel must physically inspect each machine, which is time-consuming and costly When a machine breaks down, it will generally be unable to operate for a long time Most failures go unreported and are not discovered until the next restocking crew arrives to replenish supplies Then you have to wait for a service technician to be scheduled, which can take weeks Traditional vending machines lack real-time interactivity When consumers encounter problems after inserting coins, manufacturers cannot handle them immediately In addition, traditional vending machines are less flexible and cannot adapt to changes in consumer preferences Traditional vending machines have shortcomings such as limited change shopping, single payment tools, limited number of products, and few choices Affected by the COVID-19 epidemic, consumption habits have shifted to contactless methods, causing the unmanned store market to heat up Generally, vending machines can only place relatively simple products such as drinks, food, etc The properties available for sale are limited The patented vending machine developed by Magpie can adjust the shelf space and is equipped with a lifting cargo elevator, which is suitable for various types of goods In addition, the machine is equipped with an industrial computer and a large-size touch display screen, which can meet the needs of advertising support at the same time It is expected to move towards a storeless store According to Magpie Life Observation, the consumer market trend in the past two years is that consumers demand convenient life, food consumption patterns value dining experiencesimple and fast, and are equipped with mobile phone-connected ordering models, and hot drinks and Fresh food delivery is the focus of two major trends The location, items sold, consumption methods and multiple payment methods are the focus of market growth for smart vending machines In terms of convenience, Taiwanese consumers still prefer to purchase vending machine food near stations, airports, schools, and businesses in business districts Various payment methods are also gaining more support from consumers, indicating that in the future, automatic Vending machines can be developed in two directions diversified items and diversified payment methods AI sales forecast technology integrates back-end management to achieve precise marketing purposesDue to the wide variety of products, it is difficult to know the performance of products under different factors such as season, market conditions , promotional activities, etc, it is easy to cause out-of-stock or over-inventory situations Magpie Life has specially developed "AI sales forecasting technology" and integrated it into the back-end management system, hoping to lock in customer purchasing preferences and intentions through data analysis In order to achieve the purpose of precise marketing, make accurate business decisions and effectively allocate limited resources The introduction of AI systems can achieve the three major goals of precise marketing, inventory management and supply chain management This system is a replenishment decision-making aid designed specifically for supply chain managers It uses AI to predict future sales demand, helping companies effectively optimize production capacity, inventory and distribution strategies Its overall system architecture includes1 Data exploratory analysis function Provides automatic value filling, automatic coding and automatic feature screening functions for missing values in the data 2 Modeling function 1 Provides model training functions for two types of prediction problems regression Regression and time series Time Series Forecast nbsp2 Supports Auto ML automatic modeling, and the best model is recommended by the system Integrated models can also be established to improve model accuracy nbsp3 Supports multiple algorithm types Random Forest, XGBoost, GBM and other algorithms nbsp4 Supports a variety of time series models exponential smoothing, ARIMA, ARIMAX, intermittent demand, dynamic multiple regression and other models nbsp5 Supports a variety of model evaluation indicators R, MAE, MSE, RMSE, Deviance, AUC, Lift top 1, Misclassification and other indicators nbsp6 Supports automatic cutting of training data sets and Holdout verification data sets, and can manually adjust the ratio nbsp7 Supports automatic model ensemble learning Stacked Ensemble, balancing function learning Balancing Classes, and Early Stopping nbsp8 Supports the creation of multiple models at the same time The system will allocate resources according to modeling needs, so that modeling, prediction and other tasks have independent computing resources and do not affect each other In the overall server space With an upper limit, computing resources can be used efficiently nbsp9 It has in-memory computing function, which can use large-capacity and high-speed memory to perform calculations to avoid reading and writing a large number of files from the hard disk and improve computing performance 3 Data concatenation function Using API grafting and complete data concatenation automation, there is no need to manually import data, improving user experience 4 Chart analysis function Provides visual charts and basic statistical values for product sales AI data analysis solutions have two major advantages 1 Entrepreneurship machines can be rented and sold at low cost to open unmanned physical stores and cooperate with the chain retail industry Through smart machines, entrepreneurs can rent and sell them at a lower cost than the store rent Cost of running a retail business Two cooperation models, machine sales and leasing, are provided, and the choice is based on the evaluation of the industry 2 Various types of products are put on the shelves Products are sold anytime and anywhere 24 hours a day Up to 60 kinds of diversified products can be put on the shelves Large transparent windows enhance the visibility of products Regular replenishment and tracking of product sales status are available, and product types can be adjusted according to needs Recently, the line between the Internet and the physical world has blurred, the way customers interact has changed significantly, and consumer demand is changing and personalized The retail industry is facing unprecedented challenges and uncertainties, and mastering data has become key AI data analysis solutions can help the retail industry quickly activate large amounts of data, create seamless personalized experiences, optimize the operational value chain and improve efficiency, and strengthen the core competitiveness of enterprises 「Translated content is generated by ChatGPT and is for reference only Translation date:2024-05-19」

【導入案例】赫銳特科技VCSEL封裝元件瑕疵導入AOI檢測 提升產能效率20
HRT Technology Improves Production Efficiency by 20% Through AOI Detection of Defects in VCSEL Packaging

In 2017, the launch of the iPhone X made 3D sensor technology used in Face ID highly popular, which drove the development of VCSEL, a core component in the 3D sensor module In the detection of defects in incoming packaged VCSEL, the use of AI inference models can solve the industry's issue with low yield and improve reliability to 95 VCSEL technology currently can be used in many applications and various end consumer markets, including robots, mobile devices, surveillance, drones, and ARVR VCSELs are a good solution in applications that require high-speed modulation capabilities, such as cameras and biometrics VCSEL technology has a wide range ofnbsp applications, including in drones Pictured Zoyi Technology's Agricultural Drone VCSEL technology has a wide range of applications, AI technology assists in defect detection HRT Technology stated that the packaged VCSEL market is also facing strong price competition from competitors, and needs to further reduce costs and enhance product competitiveness One of the key problems is the replacement of glass lens with epoxy resin lens The production of traditional glass lenses has high yield, but the cost is higher than that of epoxy resin lenses Due to the cutting process of epoxy resin, the side wall of cutting lines can easily have rough edges, causing it to be oversized The release of stress caused by heat during the mounting process will directly cause the optical lens to break HRT Technology pointed out that the incoming inspection of VCSEL epoxy resin lenses is very important Under the constraints of packaging space, the space for fitting the package and optical lens is limited Moreover, the optical lenses will be confined to a metal frame If the dimensional tolerances are properly controlled, stress release due to heat during mounting can easily cause the optical lens to break, resulting in a yield loss of up to 10 in the VCSEL package reliability verification, resulting in an increase in production costs In order to solve the problems above, HRT Technology hopes to use AI to monitor the size and appearance defects of epoxy resin components in the VCSEL epoxy resin lens incoming stage, verifying whether their dimensions meet specifications, whether the cutting edges are smooth, and whether there are any defects in their appearance Since traditional incoming material inspection requires a rough visual inspection by humans to distinguish the quality The problem of image collection needs to be solved first to successfully collect image data Therefore, HRT Technology first developed an Automated Optical Inspection AOI device, which includes X, Y, Z three-axis motion, high-resolution cameras, and related control software to automatically record images After collecting the image data, opencv aligns the test image and a normal image to determine differences between the two images, and then pixel mapping is used to compare the pixel area to complete initial screening Manual labeling is carried out according to the image classification above, including samples that are normal, have defects in appearance, or have different shape characteristics, and then algorithm training and verification is carried out Residual neural network ResNet or other related algorithms are used for deep learning to identify the quality of lenses Implementation of AOI inspection improves production efficiency by 20 and above Comparing the differences before and after the implementation of AI image inspection, the incoming VCSEL lens inspection before implementation only involved manual inspection of the appearance The lens is packaged on the VCSEL package that has completed die bonding After passing the general light up test, the final reliability test high temperature reflow is performed Failed samples go into the rework process However, after the implementation of AOI inspection, it can screen defective lenses sooner and reduce the cost of subsequent materials input, it can also reduce the need for rework due to failure, improving yield to 95 and above in the reliability verification This is expected to help companies reduce production costs by 10 and increase production efficiency by 20 and above The difference before and after implementing AI image detection HRT Technology pointed out that this technology is an AI application developed based on tiny images It uses deep learning algorithms to identify defects in the images The trained network automatically classifies image data to predetermined categories Defect categories can be determined through reference images, so cumbersome programming is not required In the industrial machine vision environment, deep learning is mainly used for classification tasks in applications, such as inspection of industrial products or identification of parts In the future, with the development of IoT wearable devices and the trend of energy saving, the size of optoelectronic components will continue to shrink This technology can be applied to the detection of defects in the appearance of other tiny optoelectronic components in the future

【解決方案】佐翼科技無人機導入高爾夫球場域 可節省一半人力
Droxo Tech Applies Drones in Golf Courses to Reduce Manpower by Half

For most golf courses, the operations and management is a headache "Golf courses are selling turf and need to be properly taken care of," a golf course manager bluntly pointed out Facing the market pain points of labor shortage, aging population and high cost, the use of AI drones for pesticide spraying and pest control will reduce labor costs by more than half and greatly improve the overall operational efficiency At noon in early summer, an AI drone is slowly taking off at the Taipei Golf Club in Taoyuan Its main task is to test AI drone fertilizing and pesticide spraying on the golf course In fact, drones of Droxo Tech, the company performing this task, are widely used for fertilization, pesticide spraying, and pest and disease control for rice, bananas, and tea trees For golf courses with turfs that often cover tens to hundreds of hectares, AI drones are needed to assist in turf maintenance Data collection, development of pesticide spraying AI models, and multispectral image analysis and testing will be carried out in the current stage In the future, large-scale technology implementation and verification will be carried out to set an example for applying drones to golf courses Using AI drones to fertilize and spray pesticides can reduce the manpower required by half The traditional way of maintaining the turf in golf courses is to carry spray buckets or drive spraying vehicles to spray areas one by one "Domestic golf courses began to plant ultra-dwarf Bermuda grass in 2001 This grass species prefers a cool climate and is not suitable for Taiwan's hot and humid weather" Droxo Techrsquos CEO further pointed out that to prevent turf from pests and diseases, pesticide spraying is necessary For an 18-hole golf course, it is equivalent to spraying pesticides once a week, and the T-ground and fairways are sprayed every two months For golf courses, spraying pesticides is time-consuming and labor-intensive It is important to note that large-scale spraying will increase the risk of personnel poisoning and increase the amount of pesticide used Benefits of applying agricultural drones to golf courses According to Droxo Techrsquos research, golf course pests include Spodoptera litura, which comes out at night to look for food, so pesticide spraying must be carried out in the evening According to the traditional method, pesticide spraying requires two vehicles and three personnel for a total of 45 hours If AI drones are used for fertilizing and pesticide spraying, it only takes one operator to spray 08 hectares of land in 20 minutes, saving about two-thirds of the manpower and reducing operating costs by about 30 Using AI drones to fertilize and spray pesticides on golf courses can reduce the manpower required by half In addition to the significant benefits of using agricultural drones for golf course turf maintenance, Droxo Tech also specially introduced AI multispectral image recognition for NDVI Normalized Difference Vegetation Index analysis "The so-called multispectral is to direct light with different wavelengths on the turf, and the reflected images are collected for analysis" Droxo Tech CEO Liu continued to explain that each plant absorbs light with different wavelengths, so multispectral imaging can determine the growth status of grass species At the same time, combined with AI image recognition, the distribution of pests and diseases can be accurately detected, and the amount of pesticide used is determined on this basis Cross-domain collaboration to build a multi-source turf image databasenbsp Using AI multispectral image recognition technology, Droxo Tech will collect visible light, multispectral, thermal images, and hyperspectral images to establish a multi-source turf image database to fully understand the growth cycle of Bermuda grass Droxo Tech has accumulated rich experience in agricultural AI drone pesticide spraying , but there are still many problems that need to be overcome to implement AI solutions in golf courses For example, it is necessary to establish a new pesticide spraying model and test flight methods, especially the application of multispectral image recognition PoC is not difficult, but actual implementation requires more test evidence, repeated inferences, and collaboration with plant experts This part must rely on the cross-domain integration of legal entities such as the Institute for Information Technology III, gathering more fields for verification, and creating a paradigm before it can be more widely adopted by golf courses There are not many international cases on the application of AI drones in golf courses During the verification process, it is not yet known whether it can be quickly copied to the next golf course However, Droxo Tech CEO Liu believes that through cross-domain collaboration, clearly defining the problems and listing them one by one, supply and demand parties can reach a consensus, propose solutions to each problem, and seek cooperation with internal and external resources Only then will we be able to gradually achieve the goal of making golf courses smarter and smoothly assist the industry with transformation Zuoyi Technology's CEO, Liu Junlin 「Translated content is generated by ChatGPT and is for reference only Translation date:2024-05-19」