This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognizing you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.
How can text data be quickly and accurately processed?
Using natural language processing and deep learning to classify comments in MEPS
挑战
The Medical Expenditure Panel Survey (MEPS), funded by the Agency for Healthcare Research and Quality (AHRQ), is a set of large-scale surveys of families and individuals, and their medical providers across the U.S.
超过20个,000 open-ended comments are entered by the interviewers each year into the MEPS computer-assisted personal interviewing (CAPI) system to clarify the respondents’ answers.
然后, a group of human coders reviews each sentence to assign a topic label to the sentences out of 10 predefined classes, and use the associated procedures to further process the data. 也, MEPS is a panel study and there is a short time window, 通常一周, to process the comments so that the data can get back to the field staff for use for dependent interviewing in the next wave. Processing this data is labor intensive and time consuming.
hg体育官网 harnessed the power of artificial intelligence (AI) capabilities to make the process more timely and efficient.
解决方案
hg体育官网 uses natural language processing (NLP), 机器学习(ML), and deep learning techniques to train a classification model to automatically label the comments into 10 predefined classes.
We then deploy the model as a RESTful API in production so that it can run in the backend of the system used by the human coders. The model suggests the top 3 classes for each sentence ranked by classification probability, which allows human coders to make a selection out of 3 rather than 10 when reviewing the comments.
结果
The data tool has been in production for the past 2 data collection periods in 2020. The tool achieved more than 95% classification accuracy for the top suggestion in processing 10,000+ comments for each round, with an efficiency gain of about 5% and reducing backlog to virtually zero.
-
问题简单
Doula Support in Black Maternal Health2024年4月
美国.S. currently faces a crisis in Black maternal health. Black women die from pregnancy complications at a rate 2 to 4 times higher than…
-
专家访谈
Leading with Diverse Ideas: Edwards Shares Vision for JSSAM2024年4月
With 4 decades of designing and managing large, 复杂的调查, hg体育官网’s Brad Edwards has decided to add another leadership position to his resume by becoming one…
-
专家访谈
Leveraging WIC EBT Data for In-Depth Program Analysis2024年4月
Since its founding 50 years ago, the Special Supplemental Nutrition Program for Women, 婴儿, and Children (WIC) has been a critically needed source of food…