| 3 | 1/1 | 返回列表 |
| 查看: 896 | 回復(fù): 2 | ||
tracy7777777新蟲 (初入文壇)
|
[求助]
求幫看看有沒有在sci檢索到 入藏號有嗎 已有1人參與
|
|
title: context-sensitive spelling correction of consumer-generated context on health care JMIR Med inform 2015 3卷 3期 電子版查的到,準(zhǔn)備去打檢索證明,不知道有沒有被sci收錄,請哪位大神幫忙看看,謝謝!! |
版主 (知名作家)
|
Context-Sensitive Spelling Correction of Consumer-Generated Content on Health Care 作者:Zhou, XF (Zhou, Xiaofang)[ 1,2 ] ; Zheng, A (Zheng, An)[ 2 ] ; Yin, JH (Yin, Jiaheng)[ 2,3 ] ; Chen, RD (Chen, Rudan)[ 2 ] ; Zhao, XY (Zhao, Xianyang)[ 2 ] ; Xu, W (Xu, Wei)[ 2 ] ; Cheng, WQ (Cheng, Wenqing)[ 2 ] ; Xia, T (Xia, Tian)[ 2,4 ] ; Lin, S (Lin, Simon)[ 5 ] 查看 ResearcherID 和 ORCID JMIR MEDICAL INFORMATICS 卷: 3 期: 3 頁: 2-11 文獻號: e27 DOI: 10.2196/medinform.4211 出版年: JUL-SEP 2015 摘要 Background: Consumer-generated content, such as postings on social media websites, can serve as an ideal source of information for studying health care from a consumer's perspective. However, consumer-generated content on health care topics often contains spelling errors, which, if not corrected, will be obstacles for downstream computer-based text analysis. Objective: In this study, we proposed a framework with a spelling correction system designed for consumer-generated content and a novel ontology-based evaluation system which was used to efficiently assess the correction quality. Additionally, we emphasized the importance of context sensitivity in the correction process, and demonstrated why correction methods designed for electronic medical records (EMRs) failed to perform well with consumer-generated content. Methods: First, we developed our spelling correction system based on Google Spell Checker. The system processed postings acquired from MedHelp, a biomedical bulletin board system (BBS), and saved misspelled words (eg, sertaline) and corresponding corrected words (eg, sertraline) into two separate sets. Second, to reduce the number of words needing manual examination in the evaluation process, we respectively matched the words in the two sets with terms in two biomedical ontologies: RxNorm and Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT). The ratio of words which could be matched and appropriately corrected was used to evaluate the correction system's overall performance. Third, we categorized the misspelled words according to the types of spelling errors. Finally, we calculated the ratio of abbreviations in the postings, which remarkably differed between EMRs and consumer-generated content and could largely influence the overall performance of spelling checkers. Results: An uncorrected word and the corresponding corrected word was called a spelling pair, and the two words in the spelling pair were its members. In our study, there were 271 spelling pairs detected, among which 58 (21.4%) pairs had one or two members matched in the selected ontologies. The ratio of appropriate correction in the 271 overall spelling errors was 85.2% (231/271). The ratio of that in the 58 spelling pairs was 86% (50/58), close to the overall ratio. We also found that linguistic errors took up 31.4% (85/271) of all errors detected, and only 0.98% (210/21,358) of words in the postings were abbreviations, which was much lower than the ratio in the EMRs (33.6%). Conclusions: We conclude that our system can accurately correct spelling errors in consumer-generated content. Context sensitivity is indispensable in the correction process. Additionally, it can be confirmed that consumer-generated content differs from EMRs in that consumers seldom use abbreviations. Also, the evaluation method, taking advantage of biomedical ontology, can effectively estimate the accuracy of the correction system and reduce manual examination time. 關(guān)鍵詞 作者關(guān)鍵詞:spelling correction system; context sensitive; consumer-generated content; biomedical ontology KeyWords Plus:MEDICATION EXTRACTION; DISAMBIGUATION; INFORMATION; PATIENT; ERRORS 作者信息 通訊作者地址: Xia, T (通訊作者) 顯示增強組織信息的名稱 Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Internet Technol & Engn Res & Dev Ctr, 1037 Luoyu Rd, Nanyi 430074, Peoples R China. 地址: [ 1 ] Wuhan Cent Hosp, Dept Ophthalmol, Wuhan, Peoples R China 顯示增強組織信息的名稱 [ 2 ] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Internet Technol & Engn Res & Dev Ctr, Wuhan 430074, Peoples R China 顯示增強組織信息的名稱 [ 3 ] Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China 顯示增強組織信息的名稱 [ 4 ] Northwestern Univ, Feinberg Sch Med, NUBIC, Chicago, IL 60611 USA 顯示增強組織信息的名稱 [ 5 ] Nationwide Childrens Hosp, Res Inst, Columbus, OH USA 電子郵件地址:tianxia@hust.edu.cn 出版商 JMIR PUBLICATIONS, INC, 59 WINNERS CIRCLE, TORONTO, ON M4L 3Y7, CANADA 類別 / 分類 研究方向:Medical Informatics Web of Science 類別:Medical Informatics 文獻信息 文獻類型:Article 語種:English 入藏號: WOS:000359790200001 PubMed ID: 26232246 ISSN: 2291-9694 其他信息 IDS 號: CP3OR Web of Science 核心合集中的 "引用的參考文獻": 20 Web of Science 核心合集中的 "被引頻次": 0 |

新蟲 (初入文壇)
| 3 | 1/1 | 返回列表 |
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 一志愿重慶大學(xué)085700資源與環(huán)境,總分308求調(diào)劑 +4 | 墨墨漠 2026-03-23 | 4/200 |
|
|---|---|---|---|---|
|
[考研] 工科0856求調(diào)劑 +4 | 沐析汀汀 2026-03-21 | 4/200 |
|
|
[考研] 081700 調(diào)劑 267分 +4 | 迷人的哈哈 2026-03-23 | 4/200 |
|
|
[考研] 0854電子信息求調(diào)劑 +3 | α____ 2026-03-22 | 3/150 |
|
|
[考研] 287求調(diào)劑 +8 | 晨昏線與星海 2026-03-19 | 9/450 |
|
|
[考研] 一志愿中南化學(xué)(0703)總分337求調(diào)劑 +9 | niko- 2026-03-19 | 10/500 |
|
|
[考研] 考研調(diào)劑 +4 | 來好運來來來 2026-03-21 | 4/200 |
|
|
[考研] 285求調(diào)劑 +6 | ytter 2026-03-22 | 6/300 |
|
|
[考研] 278求調(diào)劑 +9 | 煙火先于春 2026-03-17 | 9/450 |
|
|
[考研] 一志愿重慶大學(xué)085700資源與環(huán)境總分308求調(diào)劑 +7 | 墨墨漠 2026-03-20 | 7/350 |
|
|
[考研] 0805材料320求調(diào)劑 +3 | 深海物語 2026-03-20 | 3/150 |
|
|
[考研] 330求調(diào)劑0854 +3 | assdll 2026-03-21 | 3/150 |
|
|
[考研] 機械專碩299求調(diào)劑至材料 +3 | kkcoco25 2026-03-16 | 4/200 |
|
|
[考研] 一志愿華中科技大學(xué),080502,354分求調(diào)劑 +5 | 守候夕陽CF 2026-03-18 | 5/250 |
|
|
[考研] 材料專碩英一數(shù)二306 +7 | z1z2z3879 2026-03-18 | 7/350 |
|
|
[考研] 0817 化學(xué)工程 299分求調(diào)劑 有科研經(jīng)歷 有二區(qū)文章 +22 | rare12345 2026-03-18 | 22/1100 |
|
|
[考研] 求調(diào)劑 +3 | eation27 2026-03-20 | 3/150 |
|
|
[考研] 298-一志愿中國農(nóng)業(yè)大學(xué)-求調(diào)劑 +9 | 手機用戶 2026-03-17 | 9/450 |
|
|
[考研] 收復(fù)試調(diào)劑生 +4 | 雨后秋荷 2026-03-18 | 4/200 |
|
|
[考研] 333求調(diào)劑 +3 | 文思客 2026-03-16 | 7/350 |
|