Thank you for Subscribing to Business Management Review Weekly Brief
I agree We use cookies on this website to enhance your user experience. By clicking any link on this page you are giving your consent for us to set cookies. More info
Thank you for Subscribing to Business Management Review Weekly Brief
By
Business Management Review | Tuesday, August 01, 2023
IDS solves two major industry challenges: a lack of training data and constant format and layout evolution.
FREMONT, CA:"While this proved helpful, the latest breakthroughs in IDS take this capability to new heights by enabling users to input a sample document and generate various layouts, such as swapping columns in a table containing line items or shuffling sections of the document horizontally or vertically and therefore creating new training documents," said Dr. Tianhao Wu, CTO and co-founder of AYR. "This makes it possible to train AI and machine learning models to recognize and process a wide range of document layouts, which is crucial for handling the diverse and ever-changing documents that businesses deal with daily."
The 3.0 edition of AYR's innovative technology, the Intelligent Document Simulator, has been released. AYR is a Princeton-based AI business specializing in Intelligent Document Processing (IDP) and Intelligent Automation (IDS). The lack of training data for client use cases and the constantly changing forms and layouts of business documents are two of the biggest problems in the sector that IDS addresses.
Stay ahead of the industry with exclusive feature stories on the top companies, expert insights and the latest news delivered straight to your inbox. Subscribe today.
Due to the delicate nature of their documents, which often contain proprietary or individually identifying information, businesses often need help to offer training data for intelligent automation. Also, obtaining the data required to train Intelligent Document Processing systems may be difficult since technology teams may not have easy access to business papers. IDS gets around these issues by producing synthetic data that closely resembles the look and feel of actual business papers.
Users could choose randomly from user-provided dictionaries that could be automatically constructed or manually assembled in the first version of IDS, which was published in early 2022. Thanks to this, users could create as many sample documents as necessary to train their IDP models.
The most recent development from the AYR team takes one step further by enabling users to replace all words with equivalent fields or values, bringing the synthetic data even closer to actual documents. As opposed to the original iteration's dictionaries, AYR supports two methods for creating synthetic content: using its own language model to create similar phrases, words, or lines of text or using the popular GPT-3 engine to create similar content dynamically. Users can further enhance their data and document samples, much like in earlier IDS, by blurring, rotating, and making the source papers more difficult to see, simulating the difficulties businesses encounter in the real world. The augmented data is utilized to accelerate time to market and push the limits of machine learning algorithms.
"As evidenced by the market growth in IDP, there is rising demand for automated processing of documents, at greater speed and accuracy, despite limited availability of training data," said Anil Vijayan, Partner at Everest Group. "Innovations in synthetic data generation and data augmentation will help enterprises overcome training challenges and facilitate even greater adoption of IDP solutions across a wide variety of use cases."
Businesses now have the resources they need to get past the obstacle of a lack of training data and fully utilize the capabilities of IDP thanks to AYR's Intelligent Data Simulator. These latest releases show AYR's dedication to promoting innovation and expanding the capabilities of IDP technology.
More in News