NEWS

AI synthetic data is inaccurate? Visol, a company that overcomes with “motion capture”

2025-02-28

[By Lee Eunhan Lee, Digital Daily] One of the problems facing the artificial intelligence (AI) industry at home and abroad is the lack of training data. The more good data you train your AI model with, the better it will perform, but not all data is readily available for every situation. There are conflicts of interest over copyright or royalties, and sometimes the data you want is itself rare. Recently, some have suggested that ultra-high-performance AI models have already trained on so much data that in a few years, there could be a shortage of data that outstrips demand.


Synthetic data is one of the solutions to the problem. Generally, it refers to artificial data created using generative AI. For example, let’s say you’re running out of cat photo data. Let the AI model generate new cat photos with the cat data it has already trained on, and refine it to become synthetic data that can be used for training again. This has the advantage of getting the data you need quickly and at a fraction of the cost of actually collecting and processing the data.


Of course, there are downsides. The main problems are lack of sophistication and inaccuracy. AI’s lack of understanding of real-world objects, and the possibility of AI hallucinations, which can lead to inappropriate data for training, cannot be ruled out.


Visol is a company that has proven that motion capture systems can solve part of this problem. Established in 2000, Bisol has been in the field of image analysis and processing and video solutions since then, just as its mission statement combines “Visual+Solution”.


Based on its experience and assets, synthetic data is now a keyword that has emerged as Bisol’s unique competitive advantage in the synthetic data market and has brought new insights to the industry. We caught up with Myungkyun Baek, Business Development Team Leader at Bisol’s office in Gasan-dong, Seoul, to find out more.


The biggest advantage of motion-capture-based synthetic data is the “precision of the data,” says Back. Motion capture is the movement of a person wearing a special suit with reflective markers, which is recognized by a motion capture camera in space and 3D coordinates are collected, which can be used for precise 3D modeling and synthetic data generation.


Bisol has long operated its own in-house motion capture studio as part of its business. This is where you can create sophisticated 3D modeling data with dynamic movement, including people. They divide synthetic data into “static data,” which is data about the movement of stationary objects such as vehicles, and “dynamic data,” which is data about complex movements involving joints and other parts of the body, such as people. Visol’s focus is on dynamic data, which is difficult to handle with traditional generative methods.


As an example of a real-world customer story, Baek cited Company A, a company specializing in screen golf. At the time, Company A was a laggard in the market and needed to differentiate itself in the AI space to catch up to the leaders, but its screengolf training data was lacking. “We provided Company A with about 2 million synthetic golf posture shots. This included 60 3D characters of varying heights, body types, and more, three indoor and outdoor 3D environments, and a variety of camera positions.” “I know they were quite happy with the results,” he says.

Company A needed data on golf swing posture in different environments, both indoors and outdoors. Not only is this data scarce in the market, but the process of having a human coach to oversee the data is extremely time-consuming, costly, and unbalanced in quality. This is because different participants, coaches, and coaching styles can produce different data. There were dozens of scenarios that needed to be addressed.

Visol invited professional golfers and members of the public to perform the required scenario movements in its in-house motion capture studio and generated a large amount of sophisticated 3D modeling synthetic data based on the collected mistakes.
“There are so many variables: people who are right-handed but batting left, people who have poor posture to begin with, different body types, different camera angles, different perspectives,” says Baek. “On top of that, the data needs to be able to clearly identify things like ‘hidden shoulder positions’. The traditional approach is to label the photo data with a gut feeling for the position of an invisible arm or hand. The problem is that this naturally leads to inaccuracies in the actual movement process, which is effectively solved by using 3D modeling data.”

Since then, Visol has been researching the possibility of expanding the synthetic data business in various fields. Another use case that utilizes motion capture is the “Taekwondo Poomsae Judging” solution. This could be useful for AI systems to measure the accuracy of taekwondo moves and improve training efficiency. If your data is “behavioral” in nature, the possibilities for scaling are endless. In particular, all data produced by Visol is free from copyright issues, which is another competitive advantage in the AI market, where data provenance is a recent issue.
Visol is also continuing to research synthetic data for special environments. Based on the company’s long-term experience in implementing 3D modeling environments and building dynamic synthetic data, the main field of research is the creation of synthetic data assuming special situations such as “disaster” and “war. In particular, the military sector is an area where the demand for AI simulation and AI training is increasing due to the trend of global military modernization, but it is also an area where it is difficult to secure sufficient data. As with other disaster situations, Bisol plans to continue to expand its synthetic data business through proactive research and case studies in these areas.
However, he agreed that synthetic data is not a one-size-fits-all. “No training data can be the same as real-world data,” says Baek. However, because Visol uses numerically informed 3D modeled objects and high-quality motion data collected through a high-precision optical motion capture system, we are able to meet a wide range of AI performance improvement requirements in terms of accuracy, sophistication, variety, and specificity.” “Our internal machine learning (ML) fusion development team is also constantly validating the data we generate. If we can drive even a few percentage points of performance improvement with synthetic data, it will be worth it.”

As Baek says, a number of research papers have already confirmed that 3D modeling synthetic data is already showing tangible results in various areas such as game production, robotics-based product assembly, and animal movement observation and analysis.

What is the impact of synthetic data in terms of actual business and revenue? Visol currently calculates relevant revenue as synthetic data included in the supply of its AI solutions, or as direct supply of synthetic data. It is expected to generate about 2 billion to 3 billion won in revenue this year and plans to expand its business to 8 billion won next year.

“Although the number of synthetic data use cases in Korea is increasing, there is still a general perception that it is still insufficient as data. Fortunately, there is a movement to foster and encourage synthetic data at the government level,” said Dr. Baek. “However, it is time to think about how to utilize synthetic data in a more active and diverse way to strengthen competitiveness in the rapidly developing AI industry.”