• 2024-10-02【HITCON-Training】Lab 12 - SecretGarden
    学习于2024-10-0122:00:17星期二心得感想:这次真的把我整笑了,现在是10/2的晚上23点,我都不敢想象自己弄了多久(整整两天国庆的下午
  • 2024-09-30Training Records 2
    9.6CSP1*Blink题目描述小诗有一个不可重集\(S\),她记得\(S\)的元素数量\(n\)与\(\gcd⁡(S)+\operatorname{lcm}⁡(S)\)的值\(m\),但她已经忘记\(S\)由什么元素构成,她想知道有多少种符合条件的构成方案。由于答案过大,你只需告诉她答案\(\bmodP\)的值。对于
  • 2024-09-30Training Records 3
    9.30CSP7Alink题目描述给定\(5\)个长度为\(n\)的整数序列\(A,B,C,D,E\),求\[\sum_{i=1}^n\sum_{j=1}^n\sum_{k=1}^n\sum_{l=1}^l\sum_{m=1}^nmed(A_i,B_j,C_k,D_l,E_m)\mod998244353\]其中,\(med(a,b,c,d,e)\)为\(a,b,c,d,e\)的中位数。枚举中位数,计算即可
  • 2024-09-292024 Autumn Training #1 DF (by hzy)
    D.咸鱼跑酷(解有限trick)大意:长度n跑道,每个点可以二选一道具(+or*一个正数),q个询问从初始分数u,从l跑到r,求最大分数(结果模P)。可以预处理\(mul_i\)和\(add_i\),每个点要么乘要么加的数,把点分为两类,可乘点与不可乘点,\(mul_i=1\)意味着\(i\)点不可乘只能加,决策固定,因此我们需
  • 2024-09-282024 Autumn Training #2 CG (by hzy)
    C.Black-WhiteCubicLattice(网络流)大意:三维空间\(n*m*l\)格点黑白染色,已有初始色,每个点有翻转的代价\(w\),要求以最小的代价构造\((1,1,1)\)为黑,\((n,m,l)\)为白,且不存在内白外黑的点对。禁止内白外黑,考虑最小割,每个点向内连边\(inf\),白点流出\(w\),黑点流入\(w\),则最
  • 2024-09-28Pruning Large Language Models with Semi-Structural Adaptive Sparse Training
    本文是LLM系列文章,针对《PruningLargeLanguageModelswithSemi-StructuralAdaptiveSparseTraining》的翻译。通过半结构化自适应稀疏训练修剪大型语言模型摘要1引言2相关工作3方法4实验5结论摘要大型语言模型(LLM)在各种复杂任务中的巨大成功在很
  • 2024-09-24AI6012: Machine Learning Methodologie Applications
    AI6012:MachineLearningMethodologies&pplicationsAssignment(25points)Importantnotes:tofinishthisassignment,youareallowedtolookuptextbooksorsearchmaterialsviaGoogleforreference.NOplagiarismfromclassmatesisallowed.Thesubm
  • 2024-09-23ECE598HZ: Advanced Topics in Machine Learning
    ECE598HZ:AdvancedTopicsinMachineLearningandFormalMethodsFall2024Homework1DueSep2311:59pmCTTypesetyoursolutionsusingLATEX,createasinglezip fileincludingyoursolutions(ina singlePDF file), your code, andinstructionstorun
  • 2024-09-22[CVPR2024]DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets
    在长尾数据集上,本文引入强增强(文中也称为OOD)实现对DeiT的知识蒸馏的改进,实现尾部类分类性能的提升。动机ViT相较于CNN缺少归纳偏置,如局部性(一个像素与周围的区域关系更紧密)、平移不变性(图像的主体在图像的任意位置都应该一样重要)。因此需要大型数据集进行预训练。长尾数据学习
  • 2024-09-19Meta Llama3 论文研读
    一、引言概述(Intro&Overview)Llama3是一系列基于Transformer结构的大型多语言模型,通过优化数据质量、训练规模和模型架构,旨在提升模型在各种语言理解任务中的表现。通过引入更优质的数据和更高效的训练方法,Llama3展示了在自然语言处理领域的巨大潜力。其创新点在于其综合了
  • 2024-09-18nn.Dropout()与nn.functional.dropout()的区别
    在PyTorch中,`Dropout`主要有两种常见的使用方式:1.**`torch.nn.Dropout`模块**:通常用于模型的层定义中。2.**`torch.nn.functional.dropout`(即`F.dropout`)**:通常用于在`forward`方法中直接调用。###1.**`torch.nn.Dropout`**这是PyTorch中的标准`Dropout`层,通
  • 2024-09-172017 ACM/ICPC Asia Regional Qingdao Online(SDKD 2024 Summer Training Contest J2)
    C-TheDominatorofStrings题意给定n个串,问是否有一个串包含其他所有串,有就输出这个串。思路如果有解,答案必定是最长串,一一比较即可。(没想到.find()就能过
  • 2024-09-102016 ACM/ICPC Asia Regional Qingdao Online(SDKD 2024 Summer Training Contest H2)
    A-ICountTwoThree题意给定n,求第一个\(\ge\)n的数k,且k=\(2^a3^b5^c7^d\)。思路考虑到样例很多,直接打表存入set省去数组排序操作,由于n$\le$1e9,所以只需要打到1e9后二分即可。(记得加上快读快写,T得饱饱的
  • 2024-09-07P4649 [IOI2007] training 训练路径
    P4649[IOI2007]training训练路径题意:原题地址给你一棵\(n\)个节点的树,上面还有\(m-(n-1)\)条非树边,每条非树边有一个代价\(c_i\),要求你删掉若干条非树边使得之后的这棵树满足不存在任意一个长度为偶数的简单环。保证每个节点度数\(\le10\)。trick:如果树上不存在偶环
  • 2024-09-07Distributed Training: DeepSpeed ZeRO 1/2/3 + Accelerate, Megatron-LM
    1IntroductionGithub:https://github.com/microsoft/DeepSpeedZeRO:MemoryOptimizationsTowardTrainingTrillionParameterModelsZeRO-Offload:DemocratizingBillion-ScaleModelTrainingZeRO-Infinity:BreakingtheGPUMemoryWallforExtremeScaleDee
  • 2024-09-05Yolov8-源码解析-十二-
    Yolov8源码解析(十二)comments:truedescription:LearntosimplifytheloggingofYOLOv8trainingwithCometML.Thisguidecoversinstallation,setup,real-timeinsights,andcustomlogging.keywords:YOLOv8,CometML,logging,machinelearning,training,mo
  • 2024-09-05Yolov8-源码解析-四-
    Yolov8源码解析(四)comments:truedescription:ExploretheCOCO-Posedatasetforadvancedposeestimation.Learnaboutdatasets,pretrainedmodels,metrics,andapplicationsfortrainingwithYOLO.keywords:COCO-Pose,poseestimation,dataset,keypoints,CO
  • 2024-09-05Yolov8-源码解析-十七-
    Yolov8源码解析(十七)comments:truedescription:HarnessthepowerofUltralyticsYOLOv8forreal-time,high-speedinferenceonvariousdatasources.Learnaboutpredictmode,keyfeatures,andpracticalapplications.keywords:Ultralytics,YOLOv8,modelpred
  • 2024-09-05Yolov8-源码解析-一-
    Yolov8源码解析(一)comments:truedescription:LearnhowtocontributetoUltralyticsYOLOopen-sourcerepositories.Followguidelinesforpullrequests,codeofconduct,andbugreporting.keywords:Ultralytics,YOLO,open-source,contribution,pullrequest
  • 2024-09-03Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection 论文初读
    AbstractInthispaper,wepresentanopen-setobjectdetector,calledGroundingDINO,bymarryingTransformer-baseddetectorDINOwithgroundedpre-training,whichcandetectarbitraryobjectswithhumaninputssuchascategorynamesorreferringexpre
  • 2024-09-02SDKD 2024 Summer Training Contest F2(The 13th Shandong ICPC Provincial Collegiate Programming Contes
    A-Orders题意每天能生产k个产品的工厂有n个订单,第i个订单是在a_i天交b_i个产品,问能否交付。思路订单按日期排序,记录剩下的商品.代码#define_CRT_SECURE_NO_WARNINGS#include<bits/stdc++.h>usingnamespacestd;#defineintlonglongconstintmxn=1e6+5
  • 2024-09-01Social Skill Training with Large Language Models
    本文是LLM系列文章,针对《SocialSkillTrainingwithLargeLanguageModels》的翻译。大型语言模型的社交技能训练摘要1引言2角色和模拟的LLM3APAM框架4安全部署愿景5技术挑战6评估7讨论8总结与展望摘要人们依靠解决冲突等社交技能进行有效沟通,
  • 2024-08-28AutoSynth: Learning to Generate 3D Training Datafor Object Point Cloud Registration 论文解读
    目录一、导言二、先导知识1、进化算法概述2、4pcs算法3、Super4PCS算法三、相关工作1、传统点云配准工作2、基于深度学习的点云配准3、生成训练数据集四、AutoSynth框架1、搜索空间2、进化算法3、代理任务模型五、实验 1、测试数据集2、BOP评估指标3、对比实
  • 2024-08-28SDKD 2024 Summer Training Contest E2补题
    SDKD2024SummerTrainingContestE2A-PaperWatering题意对x进行至多k次操作(平方或开方后向下取整),求可以得到多少不同的数。思路平方完一定不同,且平方完后一定能开方出整数,所以只用额外考虑开方后平方的情况。若开方再平方与原来不同,则答案加上当前变化数的次数,直到变
  • 2024-08-27GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection
    目录概符号说明GaLoreZhaoJ.,ZhangZ.,ChenB.,WangZ.,AnandkumarA.andTianY.GaLore:Memory-efficientllmtrainingbygradientlow-rankprojection.ICML,2024.概本文提出了一种优化器中高效的缓存策略.符号说明\(W_t\in\mathbb{R}^{m\timesn}\),参