Skip to content

Commit

Permalink
Update by JD
Browse files Browse the repository at this point in the history
  • Loading branch information
jduan10-stat697 committed Oct 28, 2018
1 parent ec3f14d commit 97b5be4
Showing 1 changed file with 85 additions and 1 deletion.
86 changes: 85 additions & 1 deletion STAT660-01_f18-team-1_project2_data_analysis_by_JD.sas
100644 → 100755
Original file line number Diff line number Diff line change
@@ -1 +1,85 @@

*******************************************************************************;
**************** 80-character banner for column width reference ***************;
* (set window width to banner width to calibrate line length to 80 characters *;
*******************************************************************************;

*
This file uses the following analytic dataset to address several research
questions regarding loan amounts and statistics on loans
Dataset Name: STAT660-01_f18-team-1_project2_data_preparation.sas
See included file for dataset properties
;

* environmental setup;

* set relative file import path to current directory (using standard SAS trick);
X "cd ""%substr(%sysget(SAS_EXECFILEPATH),1,%eval(%length(%sysget(SAS_EXECFILEPATH))-%length(%sysget(SAS_EXECFILENAME))))""";


* load external file that generates analytic datasets cde_2014_analytic_file,
cde_2014_analytic_file_sort_frpm, and cde_2014_analytic_file_sort_sat;
%include '.\STAT660-01_f18-team-1_project2_data_preparation.sas';


*******************************************************************************;
* Research Question Analysis Starting Point;
*******************************************************************************;

title1
'Research Question:What are the top three members that had the highest annual income?'
;

title2
'Rationale: This should help identify three members that earn the most annually'
;

*
Methodology: Use PROC PORT to sort the annual income in the combined dataset
descendingly and use PROC PRINT to output the top 3 member id accordingly.
Limitations: The highest top 3 annual income data are not visualized.
Follow Up: We can add PROC SGPLOT statement to plot a bar graph, making the annual
income differences more explicit.
;



title1
'Research Question: What is the average loan amount for each state?'
;

title2
'Rationale: This shows the lending situation from each state.'
;

*
Methodology: Use the PROC MEANS statement to compute the mean loan amount.
Limitations: We cannot know the distribution of the loan amount for each state.
Follow Up: Add min, median, and max in the PROC MEANS statement to compute the
five-number summaries.
;



title1
'Research Question: What is the purpose of the highest loan amount?'
;

title2
'Rationale: This would help identify what is the use of the largest amount of money borrowed.'
;

*
Methodology: Use PROC SORT statement to sort the loan amount descendingly and find
out the purpose of it.
Limitations: The currency symbol for the loan amount is not clear.
Follow Up: Use PROC FORMAT to add dollar sign in the data output.
;


0 comments on commit 97b5be4

Please sign in to comment.