【2020 Application Example】 AI Voice Synthesis Module, Bringing Warmth to Machine Narration

編輯群2020-06-02

In response to current trends, digital learning and mobile educational materials have attracted widespread attention!

With rapid technological advancements, effectively nurturing professionals who can 'adapt to developmental changes' is a critical concern that many businesses continually consider. Over recent years, various enterprises have progressively integrated 'digital learning' into employee training programs to enhance educational outcomes, thus bringing 'digital learning' and 'mobile educational materials' into the limelight.

Outsourced narration is costly and cannot handle large volumes of demand

▸ Differences in the digital educational material production process before and after the implementation of the AI voice synthesis system

Strategic Breakthrough Corporation of Taiwan has assisted companies in converting many seminars, physical courses, and training events conducted by public sectors into digital materials in the past years. However, during the conversion process, it required inviting teachers, finding and renting filming locations, and post-production of recordings and videos. During recording, issues such as speakers' nervousness, discomfort in front of cameras, or mispronunciations might lead to poor recording quality or constant retakes.

Though there was an option to provide customer-specific educational material narration, the outsourcing costs were high and could not handle the demand efficiently. Therefore, there was a hope to introduce AI speech synthesis technology and develop an 'Intelligent Voice Synthesis Module' to instantly convert text on slides into natural, human-like voice files, thus saving on narration costs.

Realistic Intelligent Voice Synthesis Module, providing a diversified selection of voices

▸ AI Voice Synthesis Module Illustration

Strategic Corporation of Taiwan collaborated with the AI technology team, Magic Cube Digital Ltd., using Tacotron2 combined with WaveNet and Tacotron features. Characters are embedded into Mel-scale spectrogram plots, then a modified WaveNet model acting as the vocoder synthesizes waveform in the time domain from these spectrograms, finally developing an MOS (Mean Opinion Score) for voice quality evaluation that approximates human-like intelligent voice synthesis modules.

This AI Intelligent Voice Synthesis Module, after being tested by testers using the MOS voice quality evaluation standard, received a score of 4.3, meeting the initial project target score of 4.21 and surpassing WaveNet's score of 4.08, thereby demonstrating exceptional effectiveness!

AI Intelligent Voice Synthesis Module, reducing costs and increasing profits, will effectively enhance Taiwan's digital learning industry environment!

▸ Costs have been significantly reduced after the implementation of the AI voice system, and profits have increased relatively

This AI Intelligent Voice Synthesis Module not only reduces the cost of producing digital educational materials but also solves the difficulties faced by Taiwan's industry, government, and academia in spreading digital educational materials. It can effectively enhance the efficiency of customers in producing digital teaching materials, significantly reduce labor shortages, and cost structural risks, and improve profitability.

Strategic Corporation of Taiwan will also continue to develop the 'Intelligent Transcription Module' and introduce Robotic Process Automation (RPA) to replace the current manual processes, such as captioning, dubbing, and file conversion in the production of digital educational materials, assisting in the transformation and enhancement of the domestic digital learning industry.

「Translated content is generated by ChatGPT and is for reference only. Translation date：2024-05-19」

Recommend Cases

AI Can Make Coffee! Autonomous Coffee Roasters Relying on AI for Precise Location Setting and Cultivating Loyal Customers

#AI Technology Application

#Market Expansion

#Automated Coffee Shops

Have you had your morning coffee yet Over the past decade, Taiwan has gradually formed a coffee drinking culture With the advancement of AI technology, autonomous coffee roasters can now rely on AI for precise location setting while also cultivating a loyal customer base Let's see how this is done According to the International Coffee Organization ICO, Taiwanese consume approximately 285 billion cups of coffee annually, with the coffee market in Taiwan estimated at 80 billion TWD, growing about 20 each year In recent years, the 'drinking coffee' culture in Taiwan has become synonymous with popularity, with coffee being the most frequently chosen daily beverage by 65 of the population Coffee enthusiasts, particularly the more avid ones, are willing to pay more for coffee beans that suit their tastes An increasing number of unmanned drink kiosks have also begun to appear in the Taiwanese beverage market Unmanned coffee beverage shops face difficulties in expanding quickly, primarily due to two major issues one is the appropriateness of customer flow and machine placement locations which still rely on manual analysis the second is penetrating the market of mid to high-end coffee lovers accurately AI resolves two major challenges for autonomous coffee roasters suitable placement and cultivating a loyal customer base To tackle these issues and help autonomous coffee roasters quickly break into the market, Raysharp Electronics intends to implement AI for people flow counting analysis and unfamiliar face recognition These technologies aim to calculate the crowd size at potential roaster locations and classify consumers by gender and age for more precise market analysis They also provide multiple choices for the roasting of raw coffee beans, offering a more customized service tailored to the needs and tastes of professional coffee aficionados with a pack of 'high-quality roasted beans' Since 2018, the rise of unmanned stores has been mainly due to owners wanting to reduce persistently rising rent and personnel costs However, the initial assessment of store locations still requires hourly labor expenses for manual estimation of customer flow, leading to possible miscalculations of both on-site consumers and passerby traffic These inaccuracies may prevent precise real-time analysis of customer flow, or even misguided estimations of operational efficacy after a trial run, thus missing the optimal timing for loss-preventing location retraction Raysharp Electronics introduces autonomous coffee roasters equipped with AI-based people counting analysis and facial recognition Raysharp Electronics combines AI people counting analysis and facial recognition with the coffee trend known as 'black gold', addressing the preferences of numerous coffee connoisseurs in Taiwan who enjoy personally selecting coffee beans at bulk stores and frequenting high-quality grinding cafes or chain coffee shops A new concept for the first autonomous coffee roaster offering choices based on the origin, variety, and roasting methods of coffee beans has emerged AI coffee roasters enhance customer loyalty and materials management efficiency by 20 For the advanced development of autonomous coffee roasters, Raysharp Electronics engineers have equipped the AI NVIDIA development platform on the basis of TCNNFacenet Through AI, tens of thousands of images related to gender and age are used for sample training, allowing even first-time coffee roasting customers to be easily classified using unfamiliar face recognition This gains consumer trust, enhances willingness to use, and allows for recording purchase information and future product recommendations, leading to consumer purchase behavior analysis This information helps owners tailor future material preparation based on consumer preferences for different coffee beans, reducing raw material transportation and storage issues, and improving material management efficiency by 20 Moreover, by placing these autonomous coffee roasters in high-traffic areas, owners can use cameras to capture the crowd and assess whether the machine location has an adequate customer base, quickly analyzing whether to reposition the machines, and more easily targeting the best locations for middle and high-end coffee lovers The unmanned coffee roaster features a professional roasting mode interface, providing options based on the origin and variety of coffee beans, their roasting methods light, medium, deep, and related temperature, wind speed, and timing settings If improvement needs arise during the process, engineers can adjust firmware parameters and also assist in integration with the owner's ordering system Staff members briefly describe the operation of the autonomous coffee roaster 'Black Gold' penetrates deeper into coffee shops, science parks, and commercial buildings through AI This autonomous coffee roaster targets coffee connoisseurs and can be placed in middle to high-end coffee shops to roast more customized coffee beans than those available in bulk stores Upon completing a batch of coffee beans, it immediately provides them to professional technicians within the coffee shops for grinding and manual brewing The remaining roasted beans can also be taken home for brewing and enjoyment It also adds value to coffee shops by better understanding consumer preferences for coffee beans and launching more customer-attracting drink promotions and appropriate inventory management In addition to coffee shops, the autonomous coffee roaster can also utilize AI-based people counting analysis to precisely set up near scientific parks and commercial buildings, offering high-quality coffee beans for office brewing to internal employees with high coffee consumption needs Furthermore, implementing a physical membership system can initiate coffee bean purchase promotions or periodic payment incentives, thus attracting new clients and cultivating existing customer loyalty and retention The operation interface of the smart autonomous coffee roaster「Translated content is generated by ChatGPT and is for reference only Translation date：2024-05-19」

Massive Digital Engineering AOI Intelligent Robotic Arm Inspection System Significantly Improves Defect Detection Accuracy

Taiwan is known as a manufacturing powerhouse, yet quality defect detection has always been a chronic sore point in production lines While AOI equipment is available to assist, most use fixed machinery which are limited by angles, resulting in less precise diagnostics and high false positive rates Massive Digital Engineering introduced an AOI intelligent robotic arm detection system that effectively reduces false positives and increases the accuracy of defect detection Generally, the yield rate of products affects the costs for enterprises and the return rate for customers The quality defect detection process in the manufacturing industry often necessitates a substantial amount of quality inspection labor Although there is AOI equipment to assist, these tools are mostly fixed detection machines Fixed cameras are easily limited by angles, resulting in less precise diagnostics and high false positive rates Thus, personnel need to re-screen and inspect afterwards, often manually visual inspection misses defects on average about 5, and can be as high as 20 Three major pain points in manufacturing quality detection Robotic Arm AOI with dynamic multi-angle inspection helps to solve these issues According to the practical understanding by Massive Digital Engineering, there are three main pain points in detecting product quality within the manufacturing industry Pain point one, manual inspection of product quality is prone to errors Currently, the manufacturing industry largely relies on human labor to inspect product appearance, but human judgment often entails errors, such as surface scratches, color differences, solder appearance, etc The error rate in defect judgment is high, and can only be inspected at the finished product stage, often leading to whole batch rejections and high costs in labor and production Pain point two, inability to quantify and record data from quality inspections Traditional manual inspections do not maintain inspection data, which makes it difficult to assign responsibility when quality disputes occur Moreover, high-end contract manufacturing orders from overseas brands often require traceability and corresponding defect records, which traditional human inspection methods struggle to meet Pain point three, limitations of traditional AOI visual inspection systems Current manufacturing uses AOI visual inspection systems, which due to the limitations of visual software technology, employ fixed cameras, fixed lighting, and single-angle operations This method may handle flat or linear-shaped products like rectangular or square items at a single inspection point However, it is more challenging to implement for products with complex shapes eg, irregular automotive parts, requiring multi-point and multi-degree inspections Massive Digital Engineering developed an AOI intelligent robotic arm detection system, effectively improving the accuracy of defect detection To address the pain points in quality inspection in manufacturing, Massive Digital Engineering initiated the concept of developing a multi-angle, movable inspection device, starting with the combination of two representative technologies in factory automation - robotic arms and machine vision By integrating robotic arms with AOI for dynamic multi-angle AI real-time quality inspection, the limitations of fixed inspection systems are addressed, and visual inspection techniques are enhanced by leveraging artificial intelligence, further elevating the sampling of images from flat to multi-dimensional and multi-angular Selected the automotive industry as the real-world testing ground to quickly respond to customer needs The AOI intelligent robotic arm detection system, utilizing AI technology including unsupervised learning, supervised learning, and semi-supervised learning, allows operators to use unsupervised deep learning techniques to learn about good products even when initial samples are incomplete or there are no defective samples, applying it in the visual inspection of automatic welding of car trusses This can solve issues of limited angles with fixed machinery before implementation, less precise diagnostics, and high false positive rates Automotive components are high in unit price and demand a stricter defect detection accuracy In industries that have adopted AI services, the automotive manufacturing sector was chosen as the real-world testing ground Massive Digital Engineering states that the automotive industry mainly consists of related component manufacturers and components typically have a higher unit price, hence requiring more in terms of quality inspection and yield rates, and demanding stricter accuracy Therefore, the automotive sector was chosen as the area for introduction By using a robotic arm combined with AI for dynamic multi-angle AOI visual real-time quality inspection, not only can the defect quality error rate of automotive components be improved, but the fixed-point AOI optical inspection can be enhanced to meet the measurement needs of most industries and finally, establishing a third-party system platform to build an integrated monitoring system platform, enabling immediate response and action when issues arise This system allows for recording and storing important data of products leaving the factory, serving as a basis for future digital production lines and virtual production At the same time, in the event of defects, it can immediately connect to Massive's MES monitoring system, quickly responding to the relevant manufacturing decision-making department, subsequently utilizing ERP systems for project management and reviews, effectively improving production efficiency and reducing production costs Helps to reduce communication costs and aims to become an industry standard In terms of industry integration, it provides a foundational standard for data continuity among upstream and downstream businesses, reducing communication costs within the supply chain Through certification of the contract manufacturers and brand owners, there is a chance to become the industry standard configuration Through the data database established by this project, operators can further optimize their supply chain management solutions using big data analysis Data Analysis, based on data, establish forecast planning, and utilizing technology to link upstream and downstream data of the supply chain, accurately controlling product quality In the future, when interfacing with European, American, and Japanese markets, which demand highly fine-tuned orders, operators can respond and integrate the industry supply chain Supply Chain more swiftly Ultimately, through the benchmark demonstration industry's field verification, such as with the automotive component manufacturing industry used as the benchmark demonstration field, by implementing the robotic arm combined with AI for dynamic multi-angle AOI visual real-time quality inspection system project, the supply chain connection between automotive contract manufacturers and OEMs can be optimized, becoming the industry standard Further seeking more AI teams to join the cross-industry development on the field collaboration platform, driving the overall ecosystem combining AI innovation with field application Self-driving vehicle developed by Massive Digital Engineering「Translated content is generated by ChatGPT and is for reference only Translation date：2024-05-19」

Hamastar Technology Builds an AI Model Management Platform to Accelerate the Application of AI

Riding the AI hype train, financial service providers are using their solid foundation in the industry to not only transform themselves, but also assist their customers with transformation Hamastar Technology, which has been established for over two decades, has been developing AI technology and assisting industry customers with the implementation of AI in recent years Hamastar Technology believes that to implement a complete AI project, in addition to AI theoretical knowledge, data analysis, and model training capabilities, it is also necessary to develop APIs for data, establish databases, develop front-end RWD web pages, and even consider layout design and user experience based on customer needs These tasks create technical barriers for AI startups Even from the perspective of companies that have reached a certain scale, it is hard to accumulate technical experience and accelerate business growth due repeatedly investing manpower developing similar functions in each project Institutional customers still require high level of customization for AI Using the requirements of government Agency A implemented by Hamastar Technology as an example, users must control false information from specific channels The platform needs to provide data ingestion functions for training models and predictions, and can complete natural language processing NLP text classification model training and use When the model discovers false information, it needs to immediately notify responsible personnel through messaging software The need of Agency B is to use an AI model to automatically classify petitions and immediately provide information on past cases as reference for the petitioner or officer Although the project models are similar data ingestion, model prediction, warning notification, the required functions still need to be separately developed for individual projects, and existing programs and models cannot be reused to speed up the implementation of subsequent projects After in-depth discussion, Hamastar Technology found that pain points of enterprises implementing AI projects include high implementation costs and lengthy project schedules It is difficult for a single enterprise to simultaneously have data scientists, analysts, engineers, and designers Current projects are all focused on solving the needs of specific fields, and it is difficult to reuse the AI models in other fields of application At the same time, the tools are concentrated in AI projects and cannot provide customers with total solutions In other words, due to the "limited manpower," "restricted fields," and "insufficient tools" of AI service providers, the implementation of AI technology projects requires high costs or lengthy timelines These are common problems that companies urgently need to solve Therefore, if there is an AI model application service management platform, it will be able to solve the above difficulties and not only reduce costs, but also accelerate project implementation and provide customers with one-stop solutions AI model application service management platform assists in quickly completing projects Therefore, with the support of the AI project of the Industrial Development Bureau, Ministry of Economic Affairs, Hamastar Technology carried out the "AI Model Application Service Management Platform AISP RampD Project" and engaged in the RampD of AISP products The purpose is for AI service providers to complete the AI projects with twice the result using only half the effort The AISP provides one-stop AI solutions AI service providers can quickly assemble required functions, such as data API, model management, and model prediction result monitoring subscription through existing module functions of the AISP It also provides commonly used graphical tools to help companies quickly design interactive charts or dashboards required by users, effectively reducing the labor costs required to execute projects, shortening the solution POC or implementation time, and accelerating the implementation and diffusion of industry AI In terms of product business model, in the short term, the company will extensively invite IT service providers with expertise in the field of AI to work together, and use platform services to solve the AI implementation problems faced by requesting units in various field, gradually building trust in the platform brand In the mid-term, the company hopes to gradually expand the market based on its past success, and form strategic alliances with multiple IT service providers to solve more and wider problems in specialized fields and provide more solutions for units to choose from The platform combines field experts to jointly expand overseas markets In the long term, after establishing AI strategic alliances in various specialized fields, the platform will have a large number of AI solution experts for specialized fields After accumulating a large amount of successful project experience, Hamastar Technology hopes that the AISP will be able to work with experts companies to expand into the international market Harmastar Technology Co, Ltd was formed in 2000 by recruiting numerous senior professional managers and technical experts in related fields It is committed to software technology RampD and services, and aims to develop into an international software company, actively creating opportunities for international cooperation in the industry Under the excellent leadership of its first president, the company has rapidly grown into a major software company in Taiwan

Internet Explorer

Mozilla Firefox

Google Chrome

Apple Safari (5.0)

【2020 Application Example】 AI Voice Synthesis Module, Bringing Warmth to Machine Narration

Recommend Cases

PopularTags