:::

【2020 Application Example】 AI Address Parsing, No More Hitting Walls in Searching for Coordinates

Empower addresses with spatial coordinates to help drive the 'Open Data' policy

In recent years, the government has been promoting 'Open Data' hoping that the openness of data will facilitate inter-agency data flow, enhance administrative efficiency, meet public needs, and strengthen public oversight of the government. Among them, transportation data is closely related to daily life, often reported by the public with the incidents specifying obvious local landmarks or addresses; there have also been public feedback about the traffic reports on police radio that lacked actual coordinates. Introducing these addresses, which were originally without spatial attributes, into the geographical coordinate system is one step toward 'Smart Spatial Decision Making'.

However, unstructured addresses, without manual intervention to improve the inconsistency of address formats, do not yield high location accuracy, necessitating an improvement in data quality and usability to unlock the potential applications of open data. This further aids in policy promotion and widespread application to different sectors including tourism, employment, birth and adoption.

Unregulated and diverse writing styles of addresses lead to low location accuracy

Address Locator is jointly developed by SongXu Information Co., Ltd. and YanDing Intelligent Co., Ltd. GOLiFE as a 'stand-alone address locating software' providing single or batch address location services. To imbue address data with spatial attributes, the core technology of Address Locator involves 'Address Parsing' and 'Address Location' in two stages. Initially, 'Address Parsing' distributes the addresses aimed for positioning according to administrative region hierarchy keywords: province/city, township/district, village, road/street, alley, lane, number; subsequently, 'Address Location' matches the split addresses with the parent address to obtain the location level and corresponding coordinates.

However, in the actual business integration process, since address sources are maintained separately by different authorities, a lack of consistent standards remains a common issue. Problems include: special characters (at address examples in specific regions), omitted administrative units, repetitive administrative hierarchical keywords, special street-alley segments, mismatch in Chinese numericals vs. Arabic numerals, and non-current addresses leading to complex address formats that are difficult to accurately split.

Establishing an address tokenization model, achieving precise location alignment!

To effectively handle various messy address formats and alleviate the difficulties in location alignment for the existing Address Locator, AI and Natural Language Processing technologies are implemented for 'Address Normalization' and a 'Chinese Tokenization Tool' to optimize existing address location capability. 'Address Normalization' addresses the issues of missing keywords, variant character forms, and missing administrative areas; whereas 'Chinese Tokenization Tool' helps resolve 'split errors' caused by special address formats, preventing unsuccessful positioning.

Successful address parsing through AI tokenization technology

▲ Successful address parsing through AI tokenization technology

In the past, while handling address location services, manual preprocessing for data standardization was required, hence it was not solely marketed as a product, but included in project plans that offered address location services. However, after incorporating address normalization and AI tokenization technology, it has become a complete product, significantly reducing the time users spend on manual adjustments and achieving the intended location accuracy. Furthermore, the AI-enhanced Address Locator is now introduced on the SongXu Information Co. Ltd. website, including product descriptions and official listings.

After four months of testing and modifications, AI technology was successfully incorporated into the existing address location product. From selecting the tokenization tools, building the corpus, training the model, and interfacing with product features, to complete test planning, collection from 'Government Data Open Platform' and 'Taichung City Government Data Open Platform,' including over 62 datasets and more than 300,000 addresses, achieving a complete match rate of 90.08% and a fuzzy match rate of 98%, greatly surpassing the original product in match rates and processing time!

To promote AI technology applications in the information services sector, the AI-enhanced address location service is positioned as a new solution and showcased on the SongXu company website; starting from product function introductions, explaining address regularization methods and address location features; subsequently, guiding potential customers to envision applicable scenarios including: decision analytics, precision marketing, and other applications. The product will aid various sectors’ data by assigning spatial information to addresses, delving into the context and trends of data in two-dimensional space.

Address Location Solution

▲ Address Location Solution

Providing spatial coordinates for attractions, intersections, and points of interest

Successful development and implementation of AI-enhanced products in companies focused on smart transportation systems in the domestic market revealed that, while effectively solving address location issues, they also recognized that descriptions of spatial information, beyond addresses inclusive. During their progress, integrating AI more broadly into 'Entity Recognition' is set to be an important future application not limited to address location. In an era of information overload, collecting data is straightforward; identifying keywords of interest is key. Future development directions aim to optimize these products and create more business opportunities!

「Translated content is generated by ChatGPT and is for reference only. Translation date:2024-05-19」

Recommend Cases

這是一張圖片。 This is a picture.
[2023 Case Study] AI Steps into Philanthropy: Stylish Tech at Food Banks

Taiwan Food Bank AssociationHereinafter referred to as 'the Association'With the mission of providing food aid, poverty relief, reducing food waste, and building a hunger-free network, there are locations across Taiwan that gather donations from wholesalers, intermediaries, retailers, manufacturers, and even generous individuals These sites also rescue food that would otherwise be discarded, properly allocate and distribute it to needy households, thus aiding local vulnerable families55Food banks at various locations collect daily donations from wholesale stores, intermediaries, retailers, manufacturers, and even benevolent individuals from all over Taiwan These places also rescue about-to-be-discarded edible materials, properly sort them, and distribute to needy households, assisting local vulnerable populations However, each location requires significant human and volunteer resources to manage daily operations using traditional methods of communication with non-profit organizations and donors After receiving donations, these resources are then allocated to needy families or individuals There is a potential issue of uneven distribution of resources due to a lack of digitalization and integrated information management in these processes Warehouse and Transportation Centers and Mini Food Banks Distributing Resources to the Disadvantaged The location under validation by the Kaohsiung Charitable Organizations Association,Hereinafter referred to as 'Kaohsiung Charity' In109year6month24Officially inaugurated Taiwan's first 'Food Bank-Warehouse and Transportation Center' at a location measuring200square meters, enhancing the efficiency of food resource redistribution, proper storage, and management So far, nearly two hundred tons of vegetables and fruits have been saved, serving over a hundred organizations and benefiting over5thousand vulnerable households, and continues to serve19mini food banks, with planned completion across multiple districts in Kaohsiung, distributing food resources to over10ten thousand vulnerable families Kaohsiung Charity 'Food Bank-Warehouse and Transportation Center' in the Dasha Community Photo Source Kaohsiung Charitable Organizations Association Challenges in Labor and Food Resource Management Facing the needs of a large number of economically disadvantaged families, the management of the 'Food Bank-Warehouse and Transportation Center' is particularly critical During procurement, tasks such as sorting, purging, and bookkeeping must be performed, while during shipment, food resource needs suggested by social workers must be followed These activities rely on manual judgment and accumulated experience Many volunteers involved are elderly and have limited physical strength, making warehouse tasks physically demanding and recruitment challenging If a large batch of food resources arrives, space and manpower are consumed in sorting and inventory management, raising concerns about the effective use of resources and turnover rate This highlights the challenge of scaling up food bank services while lacking corresponding labor and material management systems At the same time, food bank resources come from various donations, thus they vary greatly in type, shelf life, standards, and quantity Volunteers at mini food banks, mostly also elderly, must handle multiple responsibilities such as case services, food resource management,resource allocation, and resource development Sometimes they must also explain and accept immediate, large quantities of specific resources, such as adults receiving baby formula 'Food Bank-Warehouse and Transportation Center' Resource Inventory Relies Entirely on Manual Labor Mini Food Bank Volunteers Handle Multiple Responsibilities Photo Source Taiwan Food Bank Association Reducing Scrap Resources60 Increasing Speed of Resource Transfer80 To enhance resource management and ensure effective use of materials, and to address personnel shortages, this field validation case has introduced 'Food Bank Warehouse Resource CollectionAITo advance resource management, ensure effective use of resources, and solve manpower shortages, this validation site has implemented an 'Automated Early Warning Needs Assessment System' for the food bank's warehouse resource gathering The first part involves building a classification model, setting up and collecting warehouse information at the site, andAItraining the model Past sitewarehouse information is collected and stored in a database, allowingAIfor preprocessing, classification, and other tasks At the same time, depending on the dependency conditions of the types of goods as features, algorithms are introduced for computation and modeling, and the data collected is used for retraining, ultimately validating the field and organizing data for the five most common types of goods into training and test datasets as required The second part involves constructing the classification model using AI techniques further use of reinforcement learning constructs the management mechanism for the food bank's warehouse, perfecting the classification of donated goodsRNNTechnical construction of classification models further use of reinforcement learning constructs food bank warehouse management mechanisms, making the classification of donated goods perfectlike white rice, instant drinks, noodles, instant noodles, and canned goodscan then be automatically assigned storage based on storage assignment principles AI Service System Process and Description Source Taiwan Food Bank Association AtAIUnder forecasts, it can optimize the speed of resource transfer and allocation, effectively and accurately match resource donations reducing the loss in the donation process, increase the accuracy of resource distribution, and improve the service rate—the successful donation rate—reducing the waste of resources due to incorrect items, and enabling instant monitoring of food resource stock, ensuring operators can respond quickly to needs, effectively providing resource assistance WithAIthe system's introduction and the establishment of data intelligence, it helps the operations of the warehouse and transportation center, allowing more time for the allocation of donated goods The introduction aims to accelerate the digital service rollout for social welfare organizations, thoroughly addressing the needs of the overall vulnerable segments of society Using the system for resource allocation and dispatching Photo Source Kaohsiung Charitable Organizations Association Following this field validation, it is possible to expand the system to other food bank service pointsAIThe system can also collaborate with more non-profit organizations, public welfare groups, and charitable organizations, expanding 'Food Bank Warehouse Resource CollectionAIAutomated Early Warning Demand Assessment System' application range such as medical supply distribution, helping more organizations manage and distribute more intelligently, reducing resource wastage, and enhancing social welfare 「Translated content is generated by ChatGPT and is for reference only Translation date:2024-12-12」

這是一張圖片。 This is a picture.
CCTV Intelligent Video Search System

Search for a specific person, find someone with a suitcase entering the factory in Gao'an area Color features of the person and the object confirmed, person in blue and black top, suitcase in black color, throughCCTV the intelligent video search system, by setting object and color retrieval conditions, it can successfully locate three video clips containing the target subject This greatly aids operational staff in finding the target items, and through this system, search speed can far surpass manual effort6fold Pain Points The CSE-Kaohsiung Plant is densely equippedCCTVto monitor every corner of the plant area, but when an incidenthappens, it's impossible within a limited time throughCCTVvideo playback to find the incident, the implications and risks behind this are self-evident Many areas that are usually unmanned can easily become security blind spots Thus, how to monitor a vast plant area more intelligently and effectively is one of the crucial aspects of building a smart plant for the semiconductor industry The AES Plant in Kaohsiung covers a vast area, with many important sites requiring monitoring of personnel movements to ensure corporate secrets and employee safety 1 Automated production lines and warehouses In semiconductor enterprises’ automated production lines and warehouses, oftenAGV(Automated Guided VehicleAGVs automated guided vehicles travel at high speeds if plant personnel inadvertently enterAGVthe moving area and cannot issue a warning to the person, then the regrettable accidents that occur will be too late to reverse 2 Material and product storage areas Materials used in semiconductor-related processes are costly if areas storing materials or products are breached, there is a risk of loss of high-value materialsproducts 3 High-security areas Trade secrets relate to the core technological competitiveness of semiconductor-related enterprises if someone breaches the high-security areas, there is a risk of corporate secrets being leaked The safety of trade secrets has always been one of the most critical issues for semiconductor enterprises 4 Loading docks At AESLButthe dock area often has loading vehicles coming and going if someone intrudes into the dock area, there is a risk of vehicle collisions and accidents Additionally, goods awaiting shipment at the dock area could be stolen or potentially damaged from collisions, thus causing significant reputation and financial losses for the company, further leading to production and shipping inconvenience When an abnormal event occurs, how to quickly search for the relevant key footage from massive data Many important locations within the AES Kaohsiung Plant need to be equippedCCTVfor safety checks, butCCTVWith thousands to tens of thousands of cameras, manually searching through footage for an event requires laborious frame-by-frame review which is time-consuming and inefficient In light of advancements in computer vision, it's beneficial to utilizeAIto replace manual playback and searching Problem Scenario Object Detection The data source for object detection comprises two parts Open-source datasetsOIDv4and AES Kaohsiung PlantCCTVImage files For these files, search for usable data, specificallyOIDv4image files For these files, extract the defined nine major categories of objects for training data among them, two object categories, knives and gasoline barrels, were not found inOIDv4found usable data for knives and gasoline barrels, while the remaining seven categories of objects are available fromOIDv4useful training data found for the remaining seven categories of objects, all marked Regarding the Kaohsiung PlantCCTVimage files, select some frames Frame of the footage, and manually annotate the objects to be_detected for training and testing data Nine Major Objects Color Recognition The data source for color recognition is divided into two partsInternet image screenshots, and Kaohsiung PlantCCTVimage files Currently, no publicly available open-source datasets specifically for color recognition applications have been found, so images are collected from the web Search the web for images of the defined nine major object categories, save the images after separating the objects from the background, keeping only the object sections, and mark the images according to color Additionally, for the Kaohsiung PlantCCTVimage files, use the already-markedbounding boxextractCCTVimage files from variousFramesections of objects identified by color, and finally, visually identifiable images are marked according to color Each object category has its specific color definition, depending on the usual colors seen in these objects in real life Dynamic Ignore during Training FromOIDv4during the training of the object detection pilot model, since each image in this dataset is only marked for a single category, but the image may contain other desired detection categories unmarked For such cases, dynamic ignore techniques will be employed during training to avoid confusion Next, use the extracted training data from the Kaohsiung Plant toFine-Tuneenhance the detection rate of the object in specific designated areas Finally, select the model that computes the lowest loss value in the test set during the training process as the main object_detection model Dynamic Ignoring AIHelp You View CCTV The intelligent video search system primarily serves as an assistive system for searching surveillance footage, capable of speeding up the process of finding target events by setting search conditions for objects By simply defining the search conditions, you can quickly produce thumbnails of critical objects and playback for review, shortening the time required for manual case retrieval of the past The search time is quickly6doubled, allowing the front-end security unit to use this platform to strengthen the first line of risk management supervision and take timely preventive measures 「Translated content is generated by ChatGPT and is for reference only Translation date:2024-12-12」

【導入案例】防患於未然 麗臺科技研發心臟衰竭AI辨識技術可及早發現病徵
Preventing Problems Before They Arise: Leadtek Research Develops AI Technology for Early Detection of Heart Failure Symptoms

With the increase in the elderly population, the incidence of various chronic diseases is rising daily Among these, heart failure is not only a silent killer it has a very long disease course with a high recurrence rate, leading to increased burden on healthcare personnel However, by using medically certified electrocardiography acoustics devices, coupled with AI predictive assessment of heart failure risk and remote care systems, diagnosis can be aided significantly, helping doctors make accurate diagnoses for subsequent patient medical care or referrals Heart failure has a lengthy course and medical expenditure is five times that of diabetes If you find yourself short of breath even with minimal movement, or if you wake up from sleep needing to sit up to feel comfortable, or if you have symptoms such as swollen lower limbs, anxiety, restlessness, fatigue, or a loss of appetite, be cautious These could be signs of heart failure According to statistics, there are about 60 million people with heart failure worldwide, with 5 million new cases every year In China, nearly 290 million people suffer from cardiovascular diseases, accounting for the second leading cause of death among urban residents around 12 million of these are heart failure patients, accounting for over 59 of cardiac-related deaths The disease course of heart failure is exceptionally long, and both its recurrence and rehospitalization rates are exceedingly high, resulting in medical costs that are twice that of hypertension and five times those of diabetes According to US research statistics, the 30-day mortality rates for patients with myocardial infarction and heart failure are respectively 166 and 111, and the rehospitalization rates within 30 days are 199 and 244 The symptoms of heart failure, because they are similar to those of other diseases such as chronic obstructive pulmonary disease and asthma, have an 185 misdiagnosis rate, which poses a challenging problem for healthcare institutions Leadtek, a major graphics card manufacturer, has been investing in the medical and healthcare sector since 2000 Following two heart attacks in 2011 and 2015 experienced by Chairman Lu Kunshan, Leadtek has focused on health big data, independently developing AI technology for heart failure recognition This AI application reads patients' electrocardiograms and phonocardiograms to perform anomaly detection and model prediction of heart failure risk, enabling early detection of disease symptoms Leadtek independently developed heart failure AI recognition technology to predict medical history and risk Leadtek's independently developed heart failure AI recognition technology has the following three judgment functions 1 Prediction of heart failure history Classifies electrocardiogram and phonocardiogram data into 'with hospitalization history of heart failure' and 'no history of heart failure' 2 Risk prediction of heart failure Provides a predictive risk value of heart failure occurrence based on the electrocardiogram and phonocardiogram data 3 Prediction of heart failure recurrence risk For patients with heart failure, it reads their phonocardiogram and electrocardiogram data, assessing the risk prediction of heart failure recurrence Leadtek states that the application of heart failure AI recognition technology can assist doctors in making more efficient and accurate diagnoses, facilitating subsequent medical treatment or referrals for patients As an instance, in studies of heart failure patients discharged from Taipei Veterans General Hospital, using the EMAT Electromechanical Activation Time index and SDI Systolic Dysfunction Index calculated by the synchronized electrocardiography-acoustic device as treatment guidelines resulted in a higher survival rate compared to those treated based on traditional symptoms This research has also been published in the authoritative international cardiology journal JACC, receiving recognition in the international market System manufacturers can apply heart failure AI recognition technology for other value-added applications Leadtek states that cooperating system manufacturers can choose to build their own heart failure AI risk prediction engine, uploading their system's electrocardiogram and phonocardiogram data to Leadtek's heart failure AI risk prediction engine, which then returns risk prediction values for integration by system manufacturers cooperating manufacturers as a value-added application input Not just for clinical use, the heart failure AI risk prediction engine can also be extended for use at home or in the workplace Additionally, this system can be extended to other applications, including One, hospital outpatient screening Doctors can use the electrocardiogram and phonocardiogram recorder along with the heart failure AI risk prediction model to conduct a 10-second rapid test in outpatient and emergency departments to assess a patient's cardiac history and heart failure risk Two, discharge risk assessment Doctors can use the electrocardiogram and phonocardiogram recorder along with the heart failure AI risk prediction model to assess the heart failure risk during a patient's hospital stay The test data can serve as a pre-discharge risk assessment and prognostic indicator Three, continuous home care Patients can use the electrocardiogram and phonocardiogram recorder, wearable electrocardiogram recorder, and transmit through a home transmission box gateway to measure electrocardiogram and phonocardiogram signals at home and upload them to the amor health cloud platform for heart failure AI risk prediction analysis Patients can manage their health autonomously via an APP, reviewing historical physiological trends disease management nurses can manage member health through the health management backend Web Four, home rehabilitation training Patients can wear a health bracelet to monitor activity, fatigue, circulation, and sleep, autonomously managing their health through the mobile APP and observing the risk of heart failure, engaging in exercise and rehabilitation training to aid in swift recovery The heart failure AI recognition technology system can also be extended to employee home care applications Additionally, in factories or offices, this system can also achieve employee health management goals, with applications including One, workplace safety units Provide employees with wearable electrocardiogram recorders before they start work duties Two, physiological monitoring for business executors While executing business duties or training, employees wear wearable electrocardiogram recorders for fatigue warnings, signaling whether physiological conditions allow continued execution of tasks Task segments can use data transmission boxes or apps to upload physiological monitoring information to the health management platform, assessing the heart failure risk for operations staff, with test data serving as an indicator for enterprise resource human units and public safety Three, workplace physiological monitoring center care The workplace physiological monitoring center can inspect and record employees' historicalphysiological trends through the health cloud platform Four, workplace nursing units Nursing units receiving instructions from the physiological monitoring center can provide health management advice based on employees' physiological trends nursing centers can manage employee health through the health management backend Web Five, employees can wear health bracelets to monitor activity, fatigue, circulation, and sleep, autonomously managing their health and observing the risk of heart failure through the mobile APP, engaging in exercise and rehabilitation training to aid in rapid recovery Workplace application of heart failure cloud care and big data center diagram「Translated content is generated by ChatGPT and is for reference only Translation date:2024-05-19」