Assignment 2 - R programming

This assignment contributes 60% to your final module mark. The following learning outcomes will be assessed:

 Knowledge

1. Critical appreciation of the policies and procedures to manage data and information securely, to manage risk in technology management                                     

2.      Critical analysis of the strategic challenges, risks, opportunities and practical applications afforded by cybersecurity and data science for organisations to enable effective business operation and ensure business continuity

 Skills

 3. The ability to critically assess the appropriate technologies, infrastructures, tools and techniques required to address practical problems and challenges for organisations in data science and cybersecurity

 4.Analysis and evaluation of the professional, legal and ethical requirements of secure big data management in business and industrial environments

 

Important Information

 You are required to submit your work within the bounds of the University Infringement of Assessment Regulations (see your Programme Guide).  Plagiarism, paraphrasing and downloading large amounts of information from external sources, will not be tolerated and will be dealt with severely.  Although you should make full use of any source material, which would normally be an occasional sentence and/or paragraph (referenced) followed by your own critical analysis/evaluation.  You will receive no marks for work that is not your own. Your work may be subject to checks for originality which can include use of an electronic plagiarism detection service.

The work must be entirely your own.  The safety of your assessments is your responsibility.  You must not permit another student access to your work. Where referencing is required, unless otherwise stated, the Harvard referencing system must be used (see your Programme Guide). Please ensure that you retain a duplicate of your assignment.  We are required to send samples of student work to the external examiners for moderation purposes.  It will also safeguard in the unlikely event of your work going astray.

Submission Date and Time

Monday 20th January

Submission Location

Via Canvas

 

Assessment  002 R Analysis and Report

 Scenario: Imagine that you are a consultant working on a client sponsored project.  The client has a number of shops in the Sunderland area.  Your client would like you to conduct an analysis on their sales data. They also intend to adopt internet-based selling UK-wide and require your advice on security and other data issues. They would also like to email customers with special offers. The client has provided data in the form of tables that link:

1.      Postcodes: linking customer IDs

2.      Purchases: linking customer IDs and products bought during each year

3.      Products: linking costs

4.      Personal information: IDs, bank details, emails, addresses

 

Part 1: Analysis of the Data (50 marks)

 The first part of the assessment involves the analysis of the client’s data. The client has specifically asked for the data analysis to answer a number of business questions which you will tackle in your analysis. These questions are presented below:

1.      Who is the most frequent customer(s)? (5 marks)

2.      Which is the least profitable year in terms of money? (10 marks)

3.      What is the most profitable product(s) in terms of money in 2016? (10 marks)

4.      What is the best postcode(s) in terms of money in 2016? (10 marks)

5.      What is the least popular product in terms of frequency of sales in 2016? (5 marks)

6.      Use of ggplot2 to provide 2-3, graphs or charts etc. (10 marks)

You will need to develop R code to support your analysis, use dplyr where possible to get the numeric answers. Regarding ggplot2, be careful as to what type of plot you use and how you use them as we have many records and want the charts to be readable. You should place the R code in appendix at back of the report (it will not add to word count), section each piece of code with # comments and screenshots of outputs. The data is available to download from Canvas.

 Part 2: Customer advice - Report (50 marks)

The second part will involve writing a report (maximum 3000 words, the answers to part one are included in this total) and should include the following sections:

·         Discussion of Cyber security, big data issues: you will advise the client on based on available data and internet business plans and anything else you think appropriate (25 marks)

·         Discussion of Legal, IRM, GDPR issues: you will advise the client on based on available data and internet business plans and anything else you think appropriate (25 marks)

Penalties: Do not go over word limit (other than ±10%) as loss of marks will occur

Output: Submit PDF electronic copy to CETM50 Canvas before the deadline.