首页 > 其他分享 >COMP5310 分析数据

COMP5310 分析数据

时间:2023-03-31 14:00:11浏览次数:47  
标签:分析 COMP5310 group technique chart their report 数据 your


COMP5310 Project Stage 2A
Summarise and Analyse the Data
Due: 11:59pm on 6th of April 2023 (Week 7)
Value: 10% of the unit This stage is usually done with the same group members as you worked with for Stage 1. However, if someone is currently in a group that is not in their timetabled lab, they will need to move groups to one in their timetabled lab. If this applies to you, please urgently email [email protected] to arrange moving to a different group.
DISPUTE RESOLUTION If, during the course of the assignment work, there is a dispute among group members that
need to inform the unit coordinator, [email protected]. Make sure that your email includes your group number and tutorial session, and is explicit about the difficulty. Also, make sure this email is copied to your tutor and all the members of the group (including anyone you are complaining about). We need to know about problems in time to help fix them and deal with non- mance until a few days before the work is due to complain that someone is not delivering on their tasks). If necessary, the unit
in a group by themselves (they will need to achieve all the outcomes on their own). This option is only available up until Monday March 27th, which is the last day with time to resolve the issue before the due date. For any group issues that arise after this time, you will need to try to resolve the problem on your own, and you will continue to be treated as a single group. If
the material required for the report, or their material is not of the agreed standard, you should still have the report show what that person did. Their section enough. In such case describes the circumstances. That way, we can consider how best to apply the marking scheme. Note that it is not expected or sensible for other members to do the work that someone failed to deliver.
TASKS There are TWO individual tasks and ONE group task. The tasks should be addressed in a report, identifying which group member answered which sub-task.
INDIVIDUAL TASKS: 1. [4 marks] Each group member should answer ONE of these two sub-tasks using a
different statistical technique. At least one person from the group must answer each sub-task, but more than one person can answer the same sub-task using a different statistical technique: a. Identify a statistical technique that might be appropriate for summarisation and analysis of your dataset. For that technique:
o Name and describe the technique.
o Outline the assumptions that are required for the technique to be valid.
o Describe to what extent the assumptions are true for your dataset.
Page 2 of 4
o Justify your choice of technique in the context of the business question. b. Identify a statistical technique that is clearly not appropriate for summarisation and analysis of your dataset. For that technique:
o Name and describe the technique.
o Outline the assumptions that are required for the technique to be valid.
o Describe what assumptions are violated in your dataset.
o Justify why this technique is not appropriate for your dataset.
o Propose whether the data can be transformed in a way that makes the assumptions true and justify whether this is appropriate or not in the context of your business question.
NOTE: When justifying your conclusions, consider for example whether the technique
requires too many assumptions that are only partially true, or might make your
conclusions too unreliable to apply in your business context. Also consider the cost of
making a Type I error, and the cost of a Type II error in your business context. 2. [2 marks] Each individual should create one chart that visualises some aspect of the dataset that informs your understanding of the data and research question. Describe what conclusions you draw from the chart, and what questions it raises that you could answer in Stage 2B.
GROUP TASK: 1. [4 marks] Answer the following questions as a group: a. Describe any exploratory analysis you have undertaken to refine your understanding of the data and research question, the strengths and limitations of the exploratory analysis you undertook compared to at least one alternative, and justification for the analysis you undertook. b. Propose an approach (a particular classifier model, hypothesis test, etc) that you might take to solving your research question in Stage 2B, and any limitations or strengths of the approach compared to at least one other approach and justify your choice of approach. c. Outline, at a high level, how you will validate the approach, the strengths and limitations of the validation techniques you chose compared to at least one alternative method and justify your choice of validation techniques.
WHAT TO SUBMIT There are TWO deliverables in this stage of the project, and both should be submitted by
ONE PERSON on behalf of the whole group. 1. A written report on your work, as a PDF document. There is a maximum length for the report of 1500 words for groups of 2 and 2000 words for groups of 3. The report should have a front page, that gives the group name and lists the members involved (giving their SID and unikey, not their name), and then the body of the report should include a section for each group member (the section should state the SID/unikey of the group member who did the work reported in this section), answering the questions from the sub-task they selected, and finally a section where the group provides the answers to the group questions. 2. The code and dataset that you used to produce the analysis and charts in your report.
Page 3 of 4
This should be submitted as a single zip or tar.gz file which contains a subfolder for each group member.
MARKING
The submitted code and data may be considered as evidence to check or clarify statements made in the report.
Note: you will not be penalized in marks if you explore a reasonable question about the domain,
by looking at appropriate relationships between some aspects, and then conclude that there is
no clear relationship revealed.
Individual Task 1:
[Flawed]: States the name of the technique and answers, with valid justifications, one bullet point in their sub-task.
[Pass]: States the name of the technique and answers, with valid justifications, two bullet points in their sub-task.
[Distinction]: States the name of the technique and answers, with valid justifications, three bullet points in their sub-task.
[Full marks]: States the name of the technique and answers, with valid justifications, all four of the bullet points in their sub-task.
Individual Task 2:
[Flawed]: A chart of some data attribute.
[Pass]: A chart of some data attribute, correctly documented encoding between data attributes and visual attributes in each chart.
[Distinction]: A chart of some data attribute, and correctly documented encoding and other decisions (such as style of chart, scale etc), and sensible justification of the choice of encoding in view of the effectiveness of different visual attributes.
[Full marks]: A chart of some data attribute, and correctly documented encoding and other decisions (such as style of chart, scale, etc), and sensible justification of the choice of encoding in view of the effectiveness of different visual attributes, as well as sensible conclusions from the chart/statement of the questions it raises for Project Stage 2B.
Group Task:
[Flawed]: An answer to ALL the group questions.
[Pass]: A well-reasoned answer to ALL the group questions, including a discussion of strengths and limitations.
[Distinction]: A well-reasoned answer to ALL the group questions, including a discussion of strengths and limitations in comparison to an alternative for each question respectively.
[Full marks]: A well-reasoned answer to ALL the group questions, including a discussion of strengths and limitations in comparison to an alternative, and a justification of your choice
Page 4 of 4
for each question respectively.
Penalties 10% of the overall mark will be deducted if your report is unnecessarily longwinded and does not address the marking criteria within the word limits.
Late Work As announced in the unit outline, late work (without approved special consideration or other arrangements) suffers a penalty of 5% of the maximum marks, for each calendar day after the due date. No late work will be accepted more than 10 calendar days after the due date

WX:codehelp mailto: [email protected]

标签:分析,COMP5310,group,technique,chart,their,report,数据,your
From: https://www.cnblogs.com/nytalt/p/17276061.html

相关文章

  • Qt音视频开发32-qmedia内核回调拿图片数据
    一、前言使用qmediaplayer来打开视频并播放,默认首选会采用QVideoWidget控件来展示,优点是不用自己来绘制,一切交给了QVideoWidget控件,这样可以做到极低的CPU占用,缺点也明显,就是无法拿到每一帧的图片,很多时候我们还需要主动拿到每一帧的图片来运算做人工智能,通过不断的截图虽然也能......
  • 数据库系统的三层架构
    1、传统的数据库访问程序:(1)数据库访问和数据处理放在一起实现(2)用户界面层直接调用数据访问实现(3)整个系统功能放在同一项目中实现  2、三层架构模式 三层架构:   (1)界面层(UI)为用户提供一种交互式操作界面。作用:根据用户的具体需求,为每个功能模块部署输......
  • Redis数据库高可用
    一、Redis高可用在web服务器中,高可用是指服务器可以正常访问的时问,衡量的标准是在多长时间内可以提供正常服务(99.9%、99.99%99.998等等)。但是在Redis语境中,高可用的含义似乎要宽泛一些,除了保证提供正常服务(如主从分离、快速容灾技术),还需要考虑数据容量的扩展、数据安全不会丢......
  • CloudCanal 落地 DB2 数据迁移同步功能
    简述Db2是一款具有悠久历史的关系型数据库,由IBM公司开发和维护,广泛应用于金融级业务场景。CloudCanal近期提供了Db2为源端的数据迁移同步功能,用户可以便利地将Db2中数据实时同步到其他数据库,实现数据更广泛、更实时的应用。功能介绍目标数据库和能力目标端数据源......
  • 数据库重构探讨系列(1)
    数据库重构探讨系列(1)基础 1、数据库重构分成6类:2、数据库味道与“代码味道”概念相似,代码味道是代码中出现常见问题,表明需要进行重构。数据库味道表明数据库需要重构。这些味道包括:(1)多用途的列如一个列被用于多种用途,就可能存在额外的代码来确保源数据以“正确的方式......
  • 接口自动化之测试数据动态生成并替换
    一、测试数据1.随机库random查看内置random方法,该方法自行学习,不再介绍。showprint([namefornameindir(random)ifcallable(getattr(random,name))])['Random','SystemRandom','_Sequence','_Set','_accumulate','_acos......
  • RestSharp组件Get请求带body的时候返回数据丢失问题
    postman的复制代码默认就是RestSharp。方便也好用,但是使用get请求并且带Body的时候要注意,返回的数据竟然会有丢失解决办法:stringRequestByGet(stringindex,stringaction,objectparamter){varapi=$"{ElasticsearchUrl}/{index}/{action}";HttpWebRequestre......
  • azure databricks使用external hive metastore跨工作区共享元数据
    为什么要使用externalhivemetastore可以跨workspace的共享元数据,不用每次创建workspace的时候都重复的把元数据重建一次。更好的元数据集中管理,Createonce,useeverywhere。为灾难恢复(DR)做好为准备,并降低复杂性。(PAAS一样会存在意外的,不要以为不会,所以DR是必须的)可以更好控......
  • 千万级数据量表如何快速添加索引/字段
    添加字段语句ALTERTABLEid_tADDtitle(255)DEFAULT''COMMENT'标题'AFTERid;问题线上的一张表如果数据量很大千万级,执行加字段加索隐操作就会锁表,这个过程可能需要很长时间甚至导致服务崩溃,那么这样操作就很有风险了。解决一1.创建一个临时的新表,首先复制旧表的结......
  • 技术贴,必看!谷歌云新的 BigQuery 版本:数据云的灵活性和可预测性
    【CloudAce是谷歌云全球战略合作伙伴,拥有300多名工程师,也是谷歌最高级别合作伙伴,多次获得GoogleCloud合作伙伴奖。作为谷歌托管服务商,我们提供谷歌云、谷歌地图、谷歌办公套件、谷歌云认证培训服务。】 在数据平台方面,组织需要灵活性、可预测的定价和最佳性价比。 ......