| 3 | 1/1 | 返回列表 |
| 查看: 897 | 回復(fù): 2 | |||
tracy7777777新蟲 (初入文壇)
|
[求助]
求幫看看有沒有在sci檢索到 入藏號(hào)有嗎 已有1人參與
|
|
title: context-sensitive spelling correction of consumer-generated context on health care JMIR Med inform 2015 3卷 3期 電子版查的到,準(zhǔn)備去打檢索證明,不知道有沒有被sci收錄,請(qǐng)哪位大神幫忙看看,謝謝。! |
版主 (知名作家)
|
Context-Sensitive Spelling Correction of Consumer-Generated Content on Health Care 作者:Zhou, XF (Zhou, Xiaofang)[ 1,2 ] ; Zheng, A (Zheng, An)[ 2 ] ; Yin, JH (Yin, Jiaheng)[ 2,3 ] ; Chen, RD (Chen, Rudan)[ 2 ] ; Zhao, XY (Zhao, Xianyang)[ 2 ] ; Xu, W (Xu, Wei)[ 2 ] ; Cheng, WQ (Cheng, Wenqing)[ 2 ] ; Xia, T (Xia, Tian)[ 2,4 ] ; Lin, S (Lin, Simon)[ 5 ] 查看 ResearcherID 和 ORCID JMIR MEDICAL INFORMATICS 卷: 3 期: 3 頁(yè): 2-11 文獻(xiàn)號(hào): e27 DOI: 10.2196/medinform.4211 出版年: JUL-SEP 2015 摘要 Background: Consumer-generated content, such as postings on social media websites, can serve as an ideal source of information for studying health care from a consumer's perspective. However, consumer-generated content on health care topics often contains spelling errors, which, if not corrected, will be obstacles for downstream computer-based text analysis. Objective: In this study, we proposed a framework with a spelling correction system designed for consumer-generated content and a novel ontology-based evaluation system which was used to efficiently assess the correction quality. Additionally, we emphasized the importance of context sensitivity in the correction process, and demonstrated why correction methods designed for electronic medical records (EMRs) failed to perform well with consumer-generated content. Methods: First, we developed our spelling correction system based on Google Spell Checker. The system processed postings acquired from MedHelp, a biomedical bulletin board system (BBS), and saved misspelled words (eg, sertaline) and corresponding corrected words (eg, sertraline) into two separate sets. Second, to reduce the number of words needing manual examination in the evaluation process, we respectively matched the words in the two sets with terms in two biomedical ontologies: RxNorm and Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT). The ratio of words which could be matched and appropriately corrected was used to evaluate the correction system's overall performance. Third, we categorized the misspelled words according to the types of spelling errors. Finally, we calculated the ratio of abbreviations in the postings, which remarkably differed between EMRs and consumer-generated content and could largely influence the overall performance of spelling checkers. Results: An uncorrected word and the corresponding corrected word was called a spelling pair, and the two words in the spelling pair were its members. In our study, there were 271 spelling pairs detected, among which 58 (21.4%) pairs had one or two members matched in the selected ontologies. The ratio of appropriate correction in the 271 overall spelling errors was 85.2% (231/271). The ratio of that in the 58 spelling pairs was 86% (50/58), close to the overall ratio. We also found that linguistic errors took up 31.4% (85/271) of all errors detected, and only 0.98% (210/21,358) of words in the postings were abbreviations, which was much lower than the ratio in the EMRs (33.6%). Conclusions: We conclude that our system can accurately correct spelling errors in consumer-generated content. Context sensitivity is indispensable in the correction process. Additionally, it can be confirmed that consumer-generated content differs from EMRs in that consumers seldom use abbreviations. Also, the evaluation method, taking advantage of biomedical ontology, can effectively estimate the accuracy of the correction system and reduce manual examination time. 關(guān)鍵詞 作者關(guān)鍵詞:spelling correction system; context sensitive; consumer-generated content; biomedical ontology KeyWords Plus:MEDICATION EXTRACTION; DISAMBIGUATION; INFORMATION; PATIENT; ERRORS 作者信息 通訊作者地址: Xia, T (通訊作者) 顯示增強(qiáng)組織信息的名稱 Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Internet Technol & Engn Res & Dev Ctr, 1037 Luoyu Rd, Nanyi 430074, Peoples R China. 地址: [ 1 ] Wuhan Cent Hosp, Dept Ophthalmol, Wuhan, Peoples R China 顯示增強(qiáng)組織信息的名稱 [ 2 ] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Internet Technol & Engn Res & Dev Ctr, Wuhan 430074, Peoples R China 顯示增強(qiáng)組織信息的名稱 [ 3 ] Fudan Univ, Sch Life Sci, Dept Biostat & Computat Biol, Shanghai 200433, Peoples R China 顯示增強(qiáng)組織信息的名稱 [ 4 ] Northwestern Univ, Feinberg Sch Med, NUBIC, Chicago, IL 60611 USA 顯示增強(qiáng)組織信息的名稱 [ 5 ] Nationwide Childrens Hosp, Res Inst, Columbus, OH USA 電子郵件地址:tianxia@hust.edu.cn 出版商 JMIR PUBLICATIONS, INC, 59 WINNERS CIRCLE, TORONTO, ON M4L 3Y7, CANADA 類別 / 分類 研究方向:Medical Informatics Web of Science 類別:Medical Informatics 文獻(xiàn)信息 文獻(xiàn)類型:Article 語(yǔ)種:English 入藏號(hào): WOS:000359790200001 PubMed ID: 26232246 ISSN: 2291-9694 其他信息 IDS 號(hào): CP3OR Web of Science 核心合集中的 "引用的參考文獻(xiàn)": 20 Web of Science 核心合集中的 "被引頻次": 0 |

新蟲 (初入文壇)
| 3 | 1/1 | 返回列表 |
| 最具人氣熱帖推薦 [查看全部] | 作者 | 回/看 | 最后發(fā)表 | |
|---|---|---|---|---|
|
[考研] 306求0703調(diào)劑一志愿華中師范 +7 | 紙魚ly 2026-03-21 | 8/400 |
|
|---|---|---|---|---|
|
[考研] 一志愿211 初試270分 求調(diào)劑 +4 | 谷雨上岸 2026-03-23 | 5/250 |
|
|
[考研] 一志愿陜師大生物學(xué)071000,298分,求調(diào)劑 +3 | SYA! 2026-03-23 | 3/150 |
|
|
[考研] 328求調(diào)劑 +4 | LHHL66 2026-03-23 | 4/200 |
|
|
[考研] 298求調(diào)劑 +8 | 上岸6666@ 2026-03-20 | 8/400 |
|
|
[考研] 石河子大學(xué)(211、雙一流)碩博研究生長(zhǎng)期招生公告 +3 | 李子目 2026-03-22 | 3/150 |
|
|
[考研] 化學(xué)調(diào)劑 +5 | yzysaa 2026-03-21 | 5/250 |
|
|
[考研] 297求調(diào)劑 +11 | 戲精丹丹丹 2026-03-17 | 12/600 |
|
|
[考研] 0703化學(xué)297求調(diào)劑 +3 | Daisy☆ 2026-03-20 | 3/150 |
|
|
[考研] 材料與化工(0856)304求 B區(qū) 調(diào)劑 +3 | 邱gl 2026-03-21 | 3/150 |
|
|
[考研] 332求調(diào)劑 +3 | 鳳凰院丁真 2026-03-20 | 3/150 |
|
|
[考研] 307求調(diào)劑 +3 | wyyyqx 2026-03-17 | 3/150 |
|
|
[考研] 化學(xué)求調(diào)劑 +4 | 臨澤境llllll 2026-03-17 | 5/250 |
|
|
[考研] 304求調(diào)劑 +6 | 曼殊2266 2026-03-18 | 6/300 |
|
|
[考研] 一志愿南昌大學(xué),327分,材料與化工085600 +9 | Ncdx123456 2026-03-19 | 9/450 |
|
|
[考研] 一志愿中海洋材料工程專碩330分求調(diào)劑 +8 | 小材化本科 2026-03-18 | 8/400 |
|
|
[考研] 288求調(diào)劑,一志愿華南理工大學(xué)071005 +5 | ioodiiij 2026-03-17 | 5/250 |
|
|
[考研] 085600材料與化工調(diào)劑 324分 +10 | llllkkkhh 2026-03-18 | 12/600 |
|
|
[考研] 本科鄭州大學(xué)物理學(xué)院,一志愿華科070200學(xué)碩,346求調(diào)劑 +4 | 我不是一根蔥 2026-03-18 | 4/200 |
|
|
[考研] 考研調(diào)劑 +3 | 淇ya_~ 2026-03-17 | 5/250 |
|