编辑反馈的内容

02-15

Anatomical entity recognition with a hierarchical framework augmented by external resources PLOS ONE

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit, but is not suitable for publication as it currently stands. Therefore, my decision is "Major Revision."

We invite you to submit a revised version of the manuscript that addresses all of the concerns raised by the two reviewers. It is critical that you specifically address the following issues: 1) Provide more details on your methodology and data sources (possibly with examples), so that the reviewers can better evaluate the summary results provided in the tables; 2) Describe precisely what will be publicly available; 3) Thoroughly edit your revised manuscript before submission. Please note that PLoS ONE does not provide copy editing.

We encourage you to submit your revision within forty-five days of the date of this decision.

When your files are ready, please submit your revision by logging on to and following the Submissions Needing Revision link. Do not submit a revised manuscript as a new submission. Before uploading, you should proofread your manuscript very closely for mistakes and grammatical errors. Should your manuscript be accepted for publication, you may not have another chance to make corrections as we do not offer pre-publication proofs.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

In addition, when submitting your revision please include the following items:

A rebuttal letter that responds to each point brought up by the academic editor and reviewer(s). This letter should be uploaded as a 'Response to Reviewers' file.

∙ A clean revised manuscript as your 'Manuscript' file.

∙ A marked-up copy of the changes made from the previous article file

as a 'Revised Manuscript with Track Changes' file. This can be done using 'track changes' in programs such as MS Word and/or

highlighting any changes in the new document. ∙

For more information on how to upload your revised submission, see our video:

http://blogs.plos.org/everyone/2011/05/10/how-to-submit-your-revised-manuscript/

If you choose not to submit a revision, please notify us.

Yours sincerely,

Ramin Homayouni, Ph.D.

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements:

1. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold your manuscript until you get in touch with us with the accession numbers or DOIs necessary to access your data. If you wish to make changes to your data availability statement, please describe these changes in your cover letter and we will make them on your behalf.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

3. Does the manuscript adhere to the PLOS Data Policy?

Authors must follow the , which requires authors to make all data underlying the findings described in their manuscript fully available without restriction. Please refer to the author’s Data Availability Statement in the manuscript. All data and related metadata must be deposited in an appropriate public repository, unless already provided as part of the submitted article or supporting information. If there are restrictions on the ability of authors to publicly share data —e.g. privacy or use of data from a third party— these reasons must be specified.

Reviewer #1: Yes

Reviewer #2: No

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This paper presents an interesting hierarchical framework to recognize anatomical entities, which is important in healthcare domain. Authors also bring the importance and the challenges of this task. To the best of my knowledge, I summarize my comments and suggestions as follows:

1) Features for the sequence labeling problems under CRF are comprehensive and acceptable. Authors include baseline natural language features, semantic features from external knowledge about Wikipedia and WordNet, co-reference, and dictionary matching.

2) Authors conducted relatively comprehensive experiments to show the contribution of each individual features and combination of features to the overall precision and recall.

3) Problem introduction and annotation are good too.

However, some major points need to be fixed:

1) The writing of this paper is really poor. All table references are not correct, grammar errors can be seen almost every paragraph. It is very very difficult to read. It took me hundreds of hours to understand what authors try to deliver. Let me just show examples based on the abstract: a) The first sentence is not a complete sentence. "To develop....in medical records."

b) "They infer relevant anatomical...in the record but also by other diverse..." ==> "They infer relevant anatomical entities based on both explicit anatomical expressions in the record and other diverse... "

c) "The hierarchical framework was demonstrated..." ==> "The hierarchical framework was demonstrated...in F1 comparing to ???"

many others in the paper!!!!!

2) For the annotation, authors used A3 to check (A1, A2), then obtain the coefficient. Why not A3->(A1, A2), A1->(A2, A3), and A2->(A1, A3), then obtain the average coefficient? What if there is a annotation conflict, meaning that all 3 annotators do not agree? In addition, authors claim that their golden standard is not perfect, then why you still use them to do evaluations?

3) From the experimental results, CF seems to be the smallest contribution to the precision in table 5 and table 8, then why adding CF gets a lot increase in table 6 and 9? I don't believe this result. Can you give some explanations.

In addition, some suggestions,

It would be great if the paper gives some formal definition of each concept and shows some real or toy examples in figure. They can help readers to catch the point.

Reviewer #2: The manuscript by Yan Xu et al. describes the construction of an anatomical entity recognition framework based on a machine learning algorithm. This framework can recognize not only explicit expressions of anatomical entities, but also implicit expressions such as diseases, clinical treatments, and clinical tests. The authors insisted that the recognition of the implicit expressions was important because the implicit expressions are abundant in clinical records and it is from these implicit expressions that medical experts can infer the anatomical entities described in the documents.

The framework consists of three layers of entity recognizers, all of which are based on conditional random field (CRF) models. The first layer is the multi-class CRF recognizer developed for the 2009 and 2010 I2B2 challenge; this layer recognizes entities of three semantic classes: diseases, clinical treatments, and clinical tests. The other two recognizer layers are developed in this study. One (the second layer) is for explicit anatomical expression and the other (the third layer) is for implicit expression.

For use in the training and testing of the CRF models, the authors carefully made an annotated corpus of 300 clinical records (i.e., the discharge summaries in this study). The resulting annotations include 16690 explicit anatomical entity tokens and 5564 implicit anatomical entity tokens.

The authors used the following features for the construction of the CRF models and considered the relative impact on the recognition performance using precision, recall, and F-score: baseline features (a standard set of useful features for general named entity recognition tasks), ontological features DF1 and DF2 (based on some of the representative anatomical ontologies: UMLS, MeSH, RadLex, and BodyParts3D), coreference features, and world knowledge features WF1, WF2, WF3, and HF, which is based on the dictionary constructed from the terms in Wikipedia and WordNet,

whose definition sentences contain explicit anatomical entities, for the purpose of extracting implicit anatomical entities; HF is referred to as a hierarchical feature.

This study is original and addresses an important task in processing medical documents in general. Their analytical approach seems to be sound in the sense of ordinal research on natural language processing. Therefore, this manuscript seems to warrant publication in PLOS ONE.

The main criticism I have is the lack of consideration of concrete instances of anatomical dictionaries, clinical record corpuses, annotations, and experiment results. The authors only provided several numerical tables of the precision, recall, and F-score. All the main conclusions were drawn from observation of these numerical tables. Although I know that this style is common in NLP research papers, I believe that without an investigation of concrete instances, readers cannot evaluate the relative impact of the many factors that will affect the final performance.

With only a little thought, one can list up many factors that affect the final results: data sources selection for the construction of the anatomical dictionaries, relative contribution of the (four) data sources on the performance, whether there exists some particular anatomical term in the four dictionaries that has a significant effect on the performance, the total size of anatomical dictionaries, semantic type of terms included in the anatomical dictionaries, type of clinical records, total number of clinical records and sentences which are annotated by the experts, target semantic types, the choices of machine learning algorithms, and the selection of the features for the CRF models, as well as many other factors. However, observation of the series of numerical tables yields only limited information about the impact of the factors and what entities can/cannot be recognized under the proposed framework.

Therefore, at very least, the authors should provide a part of the list of 16690 ―explicit anatomical entity tokens‖ and 5564 ―implicit anatomical entity tokens‖ with their numbers of occurrences in the corpus, because these define the problem that this manuscript is addressing.

In addition, the authors should discuss what terms in the anatomical dictionary match the annotated tokens and/or the results of the Begin/Inside/Outside (BIO) calling by the CRF model. Then some explanation of the relative impact of the framework components should be provided based on the concrete instances of matching results.

A second criticism concerns the reproducibility of this study. Although the authors wrote at the end of the abstract section, ―The resources constructed for this research will be made publicly available.‖ since the resources needed for the reproduction of this study are not provided at this time, I could not evaluate whether the results can be reproduced using the resources that the authors say will be eventually provided. I know that the authors have made a great contribution to the NLP research field, not only by introducing novel concepts, but also by providing many useful resources, including software and annotated corpuses, and so I believe that the resources that will be available to the public will be quite useful for NLP researchers, but I believe that it is quite important to meet the reproducibility criteria stated in the publication criteria of PLOS ONE

(―described in sufficient detail for another researcher to reproduce the experiments described‖), and in order to meet these criteria, I expect that the authors will need to write additional paragraphs describing in sufficient detail how to reproduce the result tables. I believe that the results have been largely affected by the content of the dictionaries and annotated corpuses constructed by the authors, and therefore, without these resources, it will be quite difficult for other researchers to reproduce exactly the results described in the tables.

Minor points

Page 8, lines 7–10

I do not understand the meaning of the numbers described in Table 4.

What is the denominator of ―Coverage of explicit named entity‖? Total number of annotated tokens in the corpus? Or number of unique tokens annotated? In typical cases, rather simple anatomical terms such as ―brain‖, ―liver‖, and ―blood‖ frequently appear in the corpus, and of course these are matched readily to the anatomical dictionaries.

Page 12, lines 7–13.

The table numbering in the main text is not consistent with the actual table numbers. (Table 4, ..., Table 9 in the main text should be Table 5, …, Table 10.)

Page 14, lines 3–5

Near the top of the DISCUSSION section, the author wrote: ―While the features based on the dictionary of anatomical entity expressions greatly improved the performance on explicit anatomical entities, they do not enhance the performance on explicit anatomical entities.‖ But the second occurrence of the word ―explicit‖ should be ―implicit‖.

6. If you would like your identity to be revealed to the authors, please include your name here (optional).

Your name and review will not be published with the manuscript.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

与《编辑反馈的内容》相关的范文

05-17 技术支持部.美工.专栏编辑部工作规范

技术支持部工作规范 1、经总编辑部对外公布名单之列的成员即为的正式成员。 2、群主即为该部门负责人。 3、全体成员应严格履行各自职责，总编辑部有权开除不负责任的成员。 4、所有成员是一个热爱教师网的大家庭，任何人不做对教师网、对成员有害的事情。 5、每位成员都应该互相帮助，互相学习，共同进步。 6、成员间任何事项发布、工作汇报或反馈用邮件发送，以保留工作痕迹，QQ即时聊天只做沟通用。 7、如有成员 ...

05-25 企业内刊创刊策划书

　　企业内刊是企业文化的外在表现形式，是企业文化的重要载体。企业发展到一定阶段，会形成一定的文化氛围和底蕴。企业文化的演进过程，在很大意义上是企业对自身历史和未来的不断阐释和描绘的过程。而这种阐释和描绘不能仅仅停留在口头传播的形式上，它必须以文字的形式“固化”，才能持续并广泛传播。　　一、创刊的价值　　企业内部刊物是企业文化的一种重要载体，起着其独特的作用。创办企业内部刊物的价值主要体现在以下 ...

08-18 编辑部第二学期工作计划

编辑部第二学期工作计划一、工作方向： 1、及时将我院的动态信息传送到同学当中，使学生对能够全面及时的了解我院学习生活情况。 2、将同学的意见与建议反馈到老师手里，以便老师及时的针对同学中间出现的问题采取及时有效的措施，出现的亮点给予鼓励，以便使其能够良好的发展。 3、关注、关心其他学院建设，为取长补短、共同发展、建设优质的校园氛围做出贡献。二、部门建设：（一）、召开例会，每周一次（特殊情况灵 ...

11-24 个人职业生涯规划书

个人职业生涯规划书目录：一、标题二、引言三、个人简历四、探索自我五、探索工作世界六、职业目标定位七、评估与反馈一、职业生涯规划书 --合理规划，实事求是！二、引言：职业生涯规划是在自我分析和环境分析的基础上，结合自身的内外部特点，选择适合自己的职业发展方向，并以此为方向，跟当下的自己做出比较，以实际的行动来完善和提升自己，以期实现自己的职业生涯目标。当计划的过程发生变化或遭遇 ...

12-10 策划方案

《大学生杂志》策划方案策划目的：大学生是富有激情的，又有点迷茫的群体，这一阶段对于一个人一生的“定型”影响至关重要，没有人不想走好，但总有人没有走好。大学生杂志社秉承着“为大学生的成长成才提供前瞻性服务”的理念创办了《大学生杂志》。为了大学生杂志社能够顺利的创办，对于哪一步该做什么给自己一个更清晰的思路，特拟定此策划方案。市场机会：大学生是一个庞大的群体，而目前针对于大学生的杂志社很少，我相信 ...

04-17 时事新闻中心2014年工作总结和2014年工作要点

时事新闻中心20XX年工作总结和20XX年工作要点 20XX年，在集团和报社领导的关怀下，在张总的带领下，在本人的协助下，时事新闻中心把促进新闻质量提升、推动队伍建设作为工作常态目标，狠抓质量管理、严守新闻纪律、科学梳理流程、勇于尝试创新。较好完成了夜班采编任务。下面我把本部门工作情况向各位领导和同事们择要汇报陈述。一、强化安全意识和服务意识时事新闻中心作为报社采编流程上最后一关，责任重大。 ...

12-22 杂志编撰策划方案

杂志编撰策划方案一、办刊宗旨：宣传xx发展弘扬xx文化铸造xx精神树立xx品牌二、刊名释义：附拟刊名： 1．风采类：活力xx、和谐xx、魅力xx、风采xx、锦绣xx、富饶xx、希望xx 2．文化类：灵秀xx、灵韵xx、神韵xx、毓秀xx、文化xx 3．神话类：神话xx、传奇xx、神奇xx、洞天xx、福地xx、仙女xx 4．地名类：登高xx 5．典故类：凤凰xx 6．发展类：价值xx、 ...

12-08 分团委工作会议内容

分团委工作会议内容 1、三月份“相互关爱，和谐共存”爱心服务月主题活动方案解释。 2、关于工作简报的设计、编辑和发行的解释和落实。 3、简单介绍：《团委三年行动纲要》。 4、关于上海市新长征突击手（队）的申报。全校新长征突击手2名（优秀团干和优秀团员各1名），新长征突击队1个。3月3日前报材料。 xx 校团委(20xx)第号关于转发《关于召开xx理工大学第二十八次学生代表大会的通知》 ...

10-18 运营工作人员职责及主要工作

运营工作人员职责及主要工作一、策划 1、产品策划调研收集用户体验平配合运营负责人提出对网站产品的需求规划和用户体验改进需求 2、数据分析分析与研究本站相关信息，如会员增长、信息增加、用户行为等 3、推广策划策划各种线上线下推广方式、介质，分析效果，及时调整 4、活动策划包括线上活动、线下活动以及其它的合作活动。如展会、行业论坛等 5、活动执行各类活动的组织、执行、与具体落实。与部门内其 ...

05-30 大学2014年校庆筹划方案

大学50年校庆筹划方案50年风雨，50年辉煌。**师范大学将于**年10月迎来建校50周年华诞。这既是**师大人回顾历史、总结经验、凝聚人心、展示成就的历史时刻，也是**师大人展望明天，开拓未来，与时俱进，再创辉煌的新起点。举办好继往开来的50年校庆已成为**师大广大师生员工和十余万名海内外校友的共同心愿。　　一、50年校庆的宗旨、原则　　宗旨：总结经验，继往开来；展示成就，树立形象；凝聚人心 ...

随机推荐

猜你喜欢

编辑反馈的内容

·老师对学生鉴定评语

·毕业留言:同学录经典留言

·教师爱与责任演讲稿

·读[伊利亚特]有感

·东丰县招商引资经验交流汇报

·关于杜小龙同志班主任及学生思想政治工作的

·位移时间关系

·开发商不给办房产证怎么办

·[优秀作文]我和诗的故事

·面包厂生产实习报告

·观看影片讨论交流记录

·在"东海文化明珠""海岛文化百花"乡镇创建工作座谈会上的讲话

·2013年度办公室工作总结

·汉沽六中安全自查报告

·建设部.国家工商行政管理局物业管理委托合同

·古代十二个武功值最高的霸主

·12篇酒泉市美景诗联作品

·翡冷翠山居闲话:徐志摩散文

·南京租房合同范本

·2011杭州"错峰限行"时间.线路等具体规定