首页 > 其他分享 >Statistics and Data Analysis for Bioinformatics

Statistics and Data Analysis for Bioinformatics

时间:2024-10-30 19:41:58浏览次数:4  
标签:Gout Statistics Analysis will course report SA Data your

Statistics and Data Analysis for Bioinformatics Assessment Information

Summative Assessment
The course will be assessed by in-course assessment consisting of 2 components: a MCQ quiz (20%) and a class report (80%).

MCQ test (20%) 22/10/2024

The multiple choice question (MCQ) test will take place online within Moodle. This will assess the all course content and intended learning outcomes. You will have the opportunity to undertake a practice MCQ test in preparation.

Report (80%): due 30/10/2024

Your written report aligns to the course ILOs, particularly “Present statistical and graphical results from analyses of bioinformatic data within a written scientific report”. Using the dataset provided as your data source answer a research question set out in the “Report assignment details” appendix below. You will be expected to demonstrate your learning of course materials, i.e. include data visualisations and statistics relevant to answer your research question. Your report should be of similar format to a scientific paper of 1000-1500 words (+33% allowance, excludes figures and references) and include the following sections:
Introduction: A brief introduction to the biological context of your research question, drawing clear aims and hypotheses. Utilise the scientific literature to justify your research question.
Methods: Your statistical approach (no need to detail how data was collected unless of particular use). Methods should balance being concise with sufficient information to repeat/understand analyses carried out.
Results: Likely the largest section of your report, where you present visual and statistical results to help answer your research question. You will gain practice at reporting results throughout the course (practicals/lectures/papers).
Discussion: Briefly place your findings in the wider 代写Statistics and Data Analysis for Bioinformatics biological context and critique the chosen dataset.
References: Consistent formatting
Appendix: Can be useful for supplementary information that is important but not key to answer research question, e.g. model diagnostics. 

You should also submit your annotated R script, in a .R format. Your R Script is not marked but can provide proof/support of your work.


Formative Assessment and Feedback
Formative individual feedback will be provided to students throughout each computer laboratory practical session (asking questions and checking practical answers is encouraged), drop-in sessions and the Q&A forum. Generic and individual written feedback will be provided for both in-course summative assessments.

Appendix: Report assignment details

Background

In this assignment you will explore a new dataset. This dataset is whole blood, from humans who are either healthy (Healthy), have Gout (Gout) or have Septic Arthritis (SA). 
In Gout build-up of uric acid crystals triggers an immune response in patients’ joints. The immune system mistakenly thinks it’s a bacterial infection. It’s extremely painful, and can damage joints over time, but is not harmful other than that.
In SA there is a real bacterial infection in the joint, which quickly spreads to the blood. This is fatal in a few days if not diagnosed and treated. 
Our issue is that Gout and SA present clinically in a very similar way, and the bodies’ reaction is also very similar. This makes it hard to diagnose one from another. We want to know if this similarity extends to the transcriptional level in blood. We might then be able to use blood to diagnose SA from Gout.

Dataset
Our dataset is RNA-seq. We have three groups – Healthy, Gout and SA, and 14 samples of each. We provide:
The table of expression values by Gene (row) and Sample (column)
A sample information sheet, listing important clinical information about each sample
An annotation file – linking gene ID to gene name.
Two differential files (log2fold, p, adjusted p), comparing Healthy to Gout and Healthy to SA. 

Task
Your task is provide some insight to our main research question (see background). But there are additional aims. 
Are our groups well matched clinically? I.e. what will a table of summary statistics for the clinical information show? What do p-values between the groups at each clinical measurement show?
What are the most significantly differential genes between HC and Gout and HC and SA? What distributions do they show when we plot them? Are the genes similar?
Are these genes affected by any of the clinical measurements, such as Age, Sex, Neutrophils, Monocytes?
Are there any genes that are significantly different between Gout and SA. If so, what are they? What do they look like when plotted?

Report
You are to provide a report of 1,000 – 1,500 words in length. It must include tables and plots you feel appropriate. The report should have a brief introduction, methods, a results section, and a discussion / conclusions. All analysis should be performed in R. You must include your R script with the report.

Hints and Tips
Your report should not (and shouldn’t) be exhaustive. Just do the best with the time that you have. Prioritise things that you think are important. We are not looking for an exact “right” answer. We are looking for how you approach the problem. 
Obviously, your report should demonstrate your understanding of the material covered in the course. You don’t have to use every test and method you have learnt. Just what you believe the most appropriate. 

 

标签:Gout,Statistics,Analysis,will,course,report,SA,Data,your
From: https://www.cnblogs.com/CSSE2310/p/18516482

相关文章

  • DevExpress WinForms中文教程:Data Grid - 如何创建未绑定列
    本教程将介绍:在设计时创建未绑定列在设计时为未绑定列指定表达式在运行时编辑表达式向代码中的未绑定列提供数据编辑未绑定列中的单元格值并保存更改P.S:DevExpressWinForms拥有180+组件和UI库,能为WindowsForms平台创建具有影响力的业务解决方案。DevExpressWinForms能......
  • System.Data.SqlClient老版本引发内存溢出的问题
    有一个朋友和我说他负责的项目有内存溢出的情况,内存一直居高不下达到了8G。生产环境碰到内存溢出的情况我还没有碰到过,这次有机会实践一下,准备试试,拿到了dump文件,压缩包300M,解压出来3G。该事件是发生在2023年,写这篇文件的是已经是2024年了,为什么会间隔这么久才写这篇文章,因为当时......
  • Data-Free,多目标域适应合并方案,简单又有效 | ECCV'24
    来源:晓飞的算法工程笔记公众号,转载请注明出处论文:Training-FreeModelMergingforMulti-targetDomainAdaptation论文地址:https://arxiv.org/abs/2407.13771论文代码:https://air-discover.github.io/ModelMerging创新点对域适应的场景解析模型中的模式连通性进......
  • 有没有类似DataV开源库
    DataV是一个为开发者提供大数据可视化展示解决方案的库,以便将数据通过更直观的方式呈现给用户。本文将深入探讨:一、DataV的核心功能与特点;二、ECharts的使用及其特性;三、AntVG2的功能概览及适用场景;四、Three.js在数据可视化中的应用;五、D3.js的强大工具和灵活性。通过对比这些......
  • 深度学习入门笔记——DataLoader的使用
    如何使用数据集DataSet?在介绍DataLoader之前,需要先了解数据集DataSet的使用。Pytorch中集成了很多已经处理好的数据集,在pytorch的torchvision、torchtext等模块有一些典型的数据集,可以通过配置来下载使用。以CIFAR10数据集为例,文档已经描述的很清晰了,其中要注意的就是transform......
  • CSCI235 – Database Systems
    CSCI235 – Database Systems2024 S4Implementation Task 2Dueon 7 November 2024ScopeTheImplementationofTask2 is relatedtothecontents oftopic on Indexing.This Implementation is due byThursday, 7 November 2024, 9:00 pm Singa......
  • 什么是Nand Flash的Data Strobe信号
    NANDFlash的DataStrobe信号是一种用于同步数据传输的信号,特别是在高速NAND闪存接口(如ONFI规范的DDR模式)中使用。它帮助确保主机和NAND闪存设备之间的高效且准确的数据交换。具体而言,DataStrobe信号(简称DQS)的功能如下:数据同步:DataStrobe信号用于协调数据......
  • GaussDB技术解读——GaussDB架构介绍之数据持久化存取层(DataNode)关键技术方案
    数据持久化存取层(DataNode)关键技术方案Datanode节点主要负责数据的持久化和快速写入、读取。数据持久化采用物理日志wal,事务提交wal刷盘,对外提供逻辑日志功能,反解析物理日志为SQL逻辑日志。图1datanode数据持久化Astore:存储格式为追加写优化设计,其多版本元组采用新、老版......
  • FCA-FineDataLink认证试题及答案
    Part.1:判断题(总分:18分得分:18)第1题对于取出的API数据,希望进一步进行数据处理后再同步至数据库,此时可以在数据转换中使用API输入,然后做数据处理后再同步至数据库。(得分:2分满分:2分)A.正确B.错误第2题在进行数据连接时,可能会因为产品未内置对应版本的数据库......
  • 新接口 易宝OA-ExecuteSqlForDataSet接口存在SQL注入漏洞
    0x01阅读须知        本文章仅供参考,此文所提供的信息只为网络安全人员对自己所负责的网站、服务器等(包括但不限于)进行检测或维护参考。本文章仅用于信息安全防御技术分享,因用于其他用途而产生不良后果,作者不承担任何法律责任,请严格遵循中华人民共和国相关法律法规,禁......