首页 > 其他分享 >Web Scraping & Data Analysis

Web Scraping & Data Analysis

时间:2024-10-24 12:00:46浏览次数:1  
标签:Web movie Scraping Analysis data figures analysis marks Data

Assignment 1: Web Scraping & Data AnalysisSep 31, 2024

In this assignment, you should work with data from

The Movie Database (TMDb) is a popular platform for movie enthusiasts, offering a vastcollection of movies from all genres and regions. TMDb provides users with detailedinformation such as movie titles, release dates, cast, crew, genres, ratings, and more. It's a goto source for finding information about both classic and upcoming films, as well as the latestin TV shows.Everyone is interested in great movies, but with so many films released each year, how can

we find the best ones? Scraping high-quality data from movie websites is crucial. In thisproject, we will utilize the skills we've learned with requests and regular expressions to scrape essential movie details from The Movie Database (TMDb) website, allowing us

to build a comprehensive dataset for further analysis. Task1. You are required to scrape 200 Movies from the website and save result intoTitle of Movie5 marks

5 marksYou are free to explore data with more properties if needed.

Task2. You are required to do a data analysis on the data. What do you think is interestingabout this data? Tell a story about some interesting thing you have discovered by looking atthe data. (60 marks)

For example, which one is the best movie you might watch? Does the type of movie affectmovie sales? Which category of movies sells the best?Note: This is an open topic project. You are required to provide a novel topic anddemonstrate your hypotheses (view points) with data analysis and figures llustrations. The reports and running code (web scraping + data analysis) should be submitted usingJupter Notebook file.

Submission Checklist: Yes/No Items

Jupyter Notenook codeyour_name+id.csvMarking Guidelines

Marking Criteria Idea (5 marks)

 Presents a novel idea

 Clearly demonstrate your viewpoints.

 Demonstrates good understanding ofthe topic.

Discussion (30 marks)

 Provide convincing arguments toyour viewpoints.

 Backs up arguments with appropriatedata analysis results.

 Visualize data analysis resultsby

 using more than 5 figures.

Organization (20 marks)

 Use of figures to support ideasdiscussed in the report.

 The quality of the figures.

 These figures should be informative.

 Use of sub-titles and/or clear topicsentences.Use multiple visualization methods(line, bar, pie chart, etc, ).

Writing Style (5 marks)  Concise writing styleStrong scientific writing withoutgrammatical errors.

标签:Web,movie,Scraping,Analysis,data,figures,analysis,marks,Data
From: https://www.cnblogs.com/goodlunn/p/18498776

相关文章

  • 【开题报告+论文+源码】基于JavaWeb的中小学心理健康教育平台的设计与实现
    项目背景与意义在当今社会,随着生活节奏的加快和学习压力的增大,中小学生的心理健康问题日益凸显,引起了广泛关注。心理健康不仅关系到学生的个人成长与发展,还对整个社会的和谐稳定具有重要影响。因此,加强中小学生的心理健康教育,提高他们的心理素质,已成为当前教育领域的迫切需求......
  • DevExpress WinForms中文教程:Data Grid - 如何在代码中处理列?
    在本教程中,您将学习如何在分配数据源时启用或禁用自动列生成,如何手动填充列集合和访问单个列。请注意,本教程的重点是在代码中完成这些任务。显然您也可以使用网格的集成设计器对话框和VisualStudio的属性网格来做同样的事情,这将在单独的教程中进行描述。P.S:DevExpressWinForms......
  • 从零开始实现WEB自动化 - Chrome Extention
    上篇我们说到用ChromeExtention的方式实现WEB自动化操作,我们以Chrome浏览器插件API为标准开发,后续在插件移植也非常的方便,可以把插件分发到各个浏览器市场,让其安装。准备复制第一篇初探的代码,在VisualStudioCode中打开,后续此代码作为我们第一阶段开发的基础功能清单首......
  • JavaWeb开发实战详解
    JavaWeb开发在现代应用程序中有广泛的应用场景。以下是一些常见的JavaWeb应用类型及其示例:1.企业级应用特点:通常需要处理大量的数据和用户,具有复杂的业务逻辑,常用于企业内部。示例:客户关系管理(CRM)系统、企业资源计划(ERP)系统。技术栈:SpringFramework、Hibernate、JPA、Th......
  • 针对SpringWeb中需要注意的细节
    ​一、SpringBootWeb1、需求和环境搭建文件命名规范:Controller:控制层,存放控制器Controllermapper:持久层,数据访问层,存放mybatis的Mapper接口Service:业务层,处理逻辑性问题的业务代码pojo/domain:业务层、存放业务代码  步骤:1.创建一个新的数据库(tlias)准备数据库表(d......
  • Webhook
    Webhook本质是一种基于HTTP的回调接口,会在用户自定义的应用/页面自动发出对应的消息。有时Webhook被称为反向API,但这并不完全正确。它们不会向后运行,而是不需要您主动发起请求,只要有新数据可用,就会发送数据。1.Webhook和API有什么区别?和Webhook不同,API指的是应用程序编程接......
  • iOS开发框架--SDWebImage
    SDWebImage是一个功能强大的库,用于在iOS和macOS应用中异步下载和缓存图像。它提供了简单易用的接口,帮助开发者高效地处理网络图片。以下是SDWebImage的详细介绍,包括其主要功能、使用方法和架构。主要功能异步下载:SDWebImage在后台线程中下载图片,避免阻塞主线程,保持用......
  • CVE-2014-4210Weblogic SSRF漏洞复现
    weblogic中存在SSRF漏洞,利用该漏洞可以发送任意HTTP请求,进而攻击内网中redis、fastcgi等脆弱组件。影响版本:weblogic10.0.2-10.3.6版本一、漏洞复现第一步:进入漏洞环境并启动环境#启动cdvulhub/weblogic/ssrfdocker-composebuilddocker-composeup-d#访问http://......
  • Web简单入门
    前言html:页面结构css:页面样式js:页面交互 本篇文章只列举出常用的html标签,以及常用的css样式学习网站 HTML教程https://www.w3school.com.cn/html/index.aspCSS教程https://www.w3school.com.cn/css/index.aspJavaScript和HTMLDOM参考手册https://www.w3school.......
  • 嵌套元素的“事件”冒泡?!——WEB开发系列52
    事件处理是创建交互式用户界面的关键部分,浏览器通过事件系统让我们能够捕获和响应用户的输入,比如点击、鼠标移动、键盘输入等。什么是事件冒泡?事件冒泡是指在嵌套的HTML元素中,一个事件从最具体的元素开始,然后向上传播到更高层级的父元素。例如,如果用户点击一个嵌套的按钮,事件首先......