科研技能库/学术评估
学术写作
低风险

学术评估

对论文、研究提案、文献综述、方法部分、证据质量、引用支持以及科研写作反馈进行结构化学术工作评估。

文件预览

1 个文件
SKILL.md
4.8 KB · 可预览
---
name: scholar-evaluation
description: Structured scholarly-work evaluation for papers, proposals, literature reviews, methods sections, evidence quality, citation support, and research-writing feedback.
origin: community
---

# Scholar Evaluation

Use this skill to evaluate academic or scientific work with a repeatable rubric.

## When to Use

- Reviewing a research paper, proposal, thesis chapter, or literature review.
- Checking whether claims are supported by cited evidence.
- Evaluating methodology, study design, analysis, or limitations.
- Comparing two or more papers for quality or relevance.
- Producing structured feedback for revision.

## Evaluation Scope

Start by identifying the artifact:

- empirical research paper
- theoretical paper
- technical report
- systematic or narrative literature review
- research proposal
- thesis or dissertation chapter
- conference abstract or short paper

Then choose scope:

- **comprehensive**: all rubric dimensions
- **targeted**: one or two dimensions, such as method or citations
- **comparative**: rank multiple works against the same rubric

## Rubric

Score each applicable dimension from 1 to 5:

- 5: excellent; clear, rigorous, and publication-ready
- 4: good; minor improvements needed
- 3: adequate; meaningful gaps but usable
- 2: weak; substantial revision needed
- 1: poor; major validity or clarity problems

Use `N/A` for dimensions that do not apply.

### 1. Problem and Research Question

- Is the problem clear and specific?
- Is the contribution meaningful?
- Are scope and assumptions explicit?
- Does the question match the claimed contribution?

### 2. Literature and Context

- Is relevant prior work covered?
- Does the work synthesize rather than merely list sources?
- Are gaps accurately identified?
- Are recent and foundational sources balanced?

### 3. Methodology

- Does the method answer the research question?
- Are design choices justified?
- Are variables, datasets, participants, or materials described clearly?
- Could another researcher reproduce the work?
- Are ethical and practical constraints acknowledged?

### 4. Data and Evidence

- Are data sources credible and appropriate?
- Is sample size or corpus coverage adequate?
- Are inclusion, exclusion, and preprocessing decisions documented?
- Are missing data and bias risks discussed?

### 5. Analysis

- Are statistical, qualitative, or computational methods appropriate?
- Are baselines and controls fair?
- Are uncertainty, sensitivity, or robustness checks included when needed?
- Are alternative explanations considered?

### 6. Results and Interpretation

- Are results clearly presented?
- Do claims stay within the evidence?
- Are figures, tables, and metrics understandable?
- Are negative or null results handled honestly?

### 7. Limitations and Threats to Validity

- Are limitations specific rather than generic?
- Are internal, external, construct, and conclusion-validity risks addressed?
- Does the paper distinguish speculation from demonstrated results?

### 8. Writing and Structure

- Is the argument easy to follow?
- Are sections organized around the research question?
- Are definitions and notation clear?
- Is the tone precise and scholarly?

### 9. Citations

- Do cited papers support the claims attached to them?
- Are primary sources used where possible?
- Are reviews labeled as reviews?
- Are preprints labeled as preprints?
- Are citation metadata and links correct?

## Review Process

1. Read the abstract, introduction, figures, and conclusion for claimed
   contribution.
2. Read methods and results for evidence quality.
3. Check the strongest claims against cited sources.
4. Score each applicable dimension.
5. Separate critical blockers from revision suggestions.
6. End with concrete next edits.

## Output Template

```markdown
# Scholar Evaluation: <Artifact>

## Overall Assessment

- Overall score: <1-5 or N/A>
- Confidence: <high | medium | low>
- Summary: <3-5 sentences>

## Dimension Scores

| Dimension | Score | Evidence | Revision priority |
| --- | ---: | --- | --- |
| Problem and question |  |  |  |
| Literature and context |  |  |  |
| Methodology |  |  |  |
| Data and evidence |  |  |  |
| Analysis |  |  |  |
| Results and interpretation |  |  |  |
| Limitations |  |  |  |
| Writing and structure |  |  |  |
| Citations |  |  |  |

## Critical Issues

## Recommended Revisions

## Evidence Checks Needed
```

## Pitfalls

- Do not use the score as a substitute for concrete feedback.
- Do not penalize a paper for omitting a dimension outside its scope.
- Do not treat citation count, venue, or author reputation as proof of quality.
- Do not accept unsupported claims just because they appear in the abstract.

SKILL.md

元数据
namescholar-evaluation
description对论文、研究提案、文献综述、方法部分、证据质量、引用支持以及科研写作反馈进行结构化学术工作评估。
origincommunity

学术评估

使用此技能对学术或科研工作进行可重复的评分评估。

使用场景

  • 审阅研究论文、提案、论文章节或文献综述。
  • 检查声明是否得到引用证据的支持。
  • 评估方法、研究设计、分析或局限性。
  • 比较两篇或多篇论文的质量或相关性。
  • 提供结构化反馈供修改使用。

评估范围

首先确定评估对象:

  • 实证研究论文
  • 理论论文
  • 技术报告
  • 系统性或叙述性文献综述
  • 研究提案
  • 论文或学位论文章节
  • 会议摘要或短论文

然后选择评估维度:

  • 全面评估:涵盖所有评分维度
  • 针对性评估:聚焦一两个维度,如方法或引用
  • 比较评估:使用相同评分标准对多个工作进行排序

评分标准

对每个适用维度从1到5进行评分:

  • 5:优秀;清晰、严谨、可发表
  • 4:良好;需少量修改
  • 3:尚可;有显著不足但仍可用
  • 2:较差;需要大幅修改
  • 1:差;存在严重的有效性或清晰度问题

对不适用的维度使用N/A。

1. 问题与研究疑问

  • 问题是否清晰具体?
  • 贡献是否有意义?
  • 范围和假设是否明确?
  • 研究问题是否与所声称的贡献匹配?

2. 文献与背景

  • 是否涵盖了相关先前工作?
  • 是否对文献进行了综合而非简单罗列?
  • 研究空白是否准确识别?
  • 近期与基础文献是否平衡?

3. 方法

  • 方法是否能回答研究问题?
  • 设计选择是否有理有据?
  • 变量、数据集、参与者或材料是否清楚描述?
  • 其他研究者能否重复该工作?
  • 是否承认了伦理和实际限制?

4. 数据与证据

  • 数据来源是否可信且合适?
  • 样本量或语料覆盖是否充分?
  • 是否记录了纳入、排除和预处理决策?
  • 是否讨论了数据缺失和偏倚风险?

5. 分析

  • 统计、定性或计算方法是否适当?
  • 基线和对照是否公平?
  • 必要时是否进行了不确定性、敏感性或稳健性检验?
  • 是否考虑了替代解释?

6. 结果与解读

  • 结果是否清晰呈现?
  • 陈述是否在证据范围内?
  • 图表和度量是否易于理解?
  • 是否诚实处理了负面或无效结果?

7. 局限性与效度威胁

  • 局限性是否具体而非泛泛?
  • 是否讨论了内部、外部、构思和结论效度风险?
  • 论文是否区分了推测与验证结果?

8. 写作与结构

  • 论点是否易于理解?
  • 各部分是否围绕研究问题组织?
  • 定义和符号是否清晰?
  • 语气是否精确且学术?

9. 引用

  • 所引论文是否支持其附带的声明?
  • 是否尽可能使用一手文献?
  • 综述文章是否被标注为综述?
  • 预印本是否被标注为预印本?
  • 引用元数据和链接是否正确?

评审流程

  1. 阅读摘要、引言、图表和结论,了解声称的贡献。
  2. 阅读方法和结果,评估证据质量。
  3. 对照引用来源检查最强声明。
  4. 对每个适用维度评分。
  5. 区分严重阻碍与修改建议。
  6. 最后提供具体的下一步编辑建议。

输出模板

markdown
# 学术评估:<评估对象>

## 总体评估

- 总分:<1-5 或 N/A>
- 置信度:<高 | 中 | 低>
- 摘要:<3-5 句>

## 维度分数

| 维度 | 分数 | 证据 | 修改优先级 |
| --- | ---: | --- | --- |
| 问题与疑问 |  |  |  |
| 文献与背景 |  |  |  |
| 方法 |  |  |  |
| 数据与证据 |  |  |  |
| 分析 |  |  |  |
| 结果与解读 |  |  |  |
| 局限性 |  |  |  |
| 写作与结构 |  |  |  |
| 引用 |  |  |  |

## 严重问题

## 建议修改

## 需要检查的证据

注意事项

  • 不要用分数替代具体反馈。
  • 不要因为论文省略了超出范围外的维度而扣分。
  • 不要将引用次数、发表平台或作者声誉视为质量的证明。
  • 不要仅仅因为摘要中出现的未经验证声明就接受它们。