首页 > 其他分享 >EE5434 learning model

EE5434 learning model

时间:2024-12-05 18:31:59浏览次数:4  
标签:learning models should EE5434 report performance model your

EE5434 final project

Data were available on Nov. 5 (see the Kaggle website)Report and source codes due: 11:59PM, Dec. 6thFull mark: 100 pts.During the process, you can keep trying new machine learning models and boost the learningaccuracy.You are encouraged to form groups of size 2 with your classmatesso that the team canintroduce yourexpertise. If you prefer to do this project yourself, you canget 5 bonus points.Submission format: Report should be in PDF format. Source code should be in a notebook file(.ipynb) and also save your source code as a HTML file (.html). Thus, there are three files youneed to upload to Canvas. Remember that you should not copy anyone’s codes, which can leadto faisure of this course.Files and naming rules: If you have two members in the team, start the file name with G2,otherwise, G1. For example, you have a teammate and the team members are: Jackie Lee andXuantian Chan, name it as G2-Lee-Chan.xxx. 5 pts will be deducted if the naming rule is notfollowed. In your report, please clearly show the groupmembers.How do we grade your report? We will consider the following factors.

  1. You would get 30% (basic grade) if you correctly applied two learning models to ourclassification problem. The accuracy should be much better than random guess. Yourreport is written in generally correct English and is easy to follow. Your report shouldinclude clear explanation of your implementation details and basic analysis oftheresults.
  1. Factors in grading:
  2. Applied/implemented and compared at least 2 different models. You show goodsense in choosing appropriate models (such as some NLP related models).
  1. For each model, clear explanation of the feature encoding methods, modelstructure, etc. Carefully tuned multiple sets of parameters or feature engineeringmethods. Provided evidence of multiple methods to boost the performance.Consider performance metrics beyond accuracy (such as confusion matrix, recall,ROC, etc.). Carefully compare the performance of differentmethods/models/parameter sets. Being able to present your results using the mostinsightful means such as tables/figures etc.
  1. Well-written reports that are easy to follow/read.
  2. Final ranking on Kaggle.For each of the factor, we have unsatisfactory (1), acceptable (2), satisfactory (3), good (4),excellent (5). The sum of each factor will determine the grade. For example, student A got 4 代写EE5434  learning model good and 1 acceptable for a to e. Then, A’s total score is 4*4+2=16. The fullmark for a to e isSo, A’s percentage is 64%.Note that if the final performance is very close (e.g. 0.65 vs 0.66), the correspondingsubmissions belong to the same group in the ranking.Factors that can increase yourgrade:
  1. You used a new learning model/feature engineering method that was not taught inclass. This requires some reading and clear explanation why you think this model fits thisproblem.
  1. Your model’s performance is much better than others because of a new or optimizemethod.

The format of the report

  1. There is no page limit for the report. If you don’t have much to report, keep it simple.Also, miminize the language issues by proofreading.
  1. To make our grading more standard, please use the following sections:
  2. Abstract. Summarize the report (what you done, what methods you use and theconclusions). (less than 300 words)
  1. Data properties (data explortary analysis). You should describe yourunderstanding/analysis of the data properties.
  1. Methods/models. In this section, you should describe your implemented models.Provide key parameters. For example, what are the features? If you use kNN,what is k and how you computed the distance? If you use ANN, what is thearchitecture, etc. You should separate the high-level description of the modelsand the tuning of hyper-parameters.
  1. Experimental results. In this section, compare and summarize the results usingappropriate tables/figures. Simplying copying screening is acceptable but willlead to low mark for sure. Instead, you should *summarize* your results. Youcan also compare the performance of your model under differenthyperparameters.
  1. Conclusion and discussion. Discussion why your models perform well or poorly.
  2. Future work. Discuss what you could do if more time is given.
  3. For each model you tried, provide the codes of the model with the best performance. Inyour report, you can detail the performance of this model with different parameters.

The code The code should include:

  1. Preprocessing of the data2. Construction of the model
  2. Training
  3. Validation
  4. Testing
  5. And other code that is necessary

 

标签:learning,models,should,EE5434,report,performance,model,your
From: https://www.cnblogs.com/BUS001/p/18588297

相关文章

  • [论文阅读] Vector-quantized Image Modeling with Improved VQGAN
    Pretitle:Vector-quantizedImageModelingwithImprovedVQGANaccepted:ICLR2022paper:https://arxiv.org/abs/2110.04627code:https://github.com/thuanz123/enhancing-transformers(unofficial)ref:https://zhuanlan.zhihu.com/p/611689477关键词:quantization,......
  • 论文泛读《PICCOLO : Exposing Complex Backdoors in NLP Transformer Models》
    发表时间:2022期刊会议:IEEESymposiumonSecurityandPrivacy(SP)论文单位:PurdueUniversity论文作者:YingqiLiu,GuangyuShen,GuanhongTao,ShengweiAn,ShiqingMa,XiangyuZhang方向分类:BackdoorAttack论文链接开源代码摘要后门可以被注入到NLP模型中,使得当......
  • 使用ModelArts VS Code插件调试训练ResNet50图像分类模型
    应用场景Notebook等线上开发工具工程化开发体验不如IDE,但是本地开发服务器等资源有限,运行和调试环境大多使用团队公共搭建的CPU或GPU服务器,并且是多人共用,这带来一定的环境搭建和维护成本。因此使用本地IDE+远程Notebook结合的方式,可以同时享受IDE工程化开发和云上资源的即开......
  • AI开发平台ModelArts/ 将Notebook的Conda环境迁移到SFS磁盘
    前提条件创建一个Notebook,“资源类型”选择“专属资源池”,“存储配置”选择“SFS弹性文件服务器”,打开terminal。创建新的虚拟环境并保存到SFS目录创建新的conda虚拟环境。#shellcondacreate--prefix/home/ma-user/work/envs/user_conda/sfs-new-envpython=3.7.10-......
  • H5-24 CSS盒子模型(Box Model)
    1、概念:所有HTML元素可以看作盒子,在CSS中,"boxmodel"这一术语是用来设计和布局时使用CSS盒模型本质上是一个盒子,封装周围的元素HTML元素,它包括:外边距(margin),边框(border),内边距(padding),和实际内容(content)Margin(外边距):清楚边框外的区域,外边距是透明的Border(边框):围绕在内边距和内容外......
  • 论文总结-经典论文-联邦学习/Communication-Efficient Learning of Deep Networks fro
    本文为《基于分布式数据的深度网络高效通信学习》的总结和原文翻译稿,原文由H.BrendanMcMahan等人(Google团队)发表于ComputingResearchRepository(CoRR)(2016)。该论文首次提出“联邦学习”(FederatedLearning)的概念,解决的问题为保护用户数据隐私和避免合规风险。具体而言,移......
  • ComE:Learning Community Embedding with Community Detection and Node Embedding on
    ComE(CommunityEmbedding)LearningCommunityEmbeddingwithCommunityDetectionandNodeEmbeddingonGraphs用社区检测和图上的节点嵌入学习社区嵌入论文来源:CIKM2017https://www.sentic.net/community-embedding.pdf【2017】项目地址:https://github.com/andompesta/Co......
  • [PaperReading] HuBERT: Self-Supervised Speech Representation Learning by Masked
    目录HuBERT:Self-SupervisedSpeechRepresentationLearningbyMaskedPredictionofHiddenUnitsTL;DRMethodLearningtheHiddenUnitsforHuBERTRepresentationLearningviaMaskedPredictionLearningwithClusterEnsemblesImplementationExperiment效果可视化总结与......
  • 关于el-cascader 双向绑定值v-model的值为字符串的用法
    常规用法绑定的值为数组,但是项目中需要绑定的值为字符串才好,两种解决方式,方式1:按常规写法来做,最后将数据处理成字符串给后端方式2:直接绑定成字符串,不用来回转换格式方式2比较方便,所以选择方式2来做//dom结构<el-form-itemv-if="form.userType==='subject'"label="登......
  • Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
    本文是LLM系列文章,针对《Marco-o1:TowardsOpenReasoningModelsforOpen-EndedSolutions》的翻译。Marco-o1:面向开放式解决方案的开放推理模型摘要1引言2Marco推理数据集3通过MCTS扩展解决方案空间4推理行动策略5实验6翻译任务案例研究7结论和未来......