-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
ec3f14d
commit 97b5be4
Showing
1 changed file
with
85 additions
and
1 deletion.
There are no files selected for viewing
86 changes: 85 additions & 1 deletion
86
STAT660-01_f18-team-1_project2_data_analysis_by_JD.sas
100644 → 100755
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1,85 @@ | ||
|
||
*******************************************************************************; | ||
**************** 80-character banner for column width reference ***************; | ||
* (set window width to banner width to calibrate line length to 80 characters *; | ||
*******************************************************************************; | ||
|
||
* | ||
This file uses the following analytic dataset to address several research | ||
questions regarding loan amounts and statistics on loans | ||
Dataset Name: STAT660-01_f18-team-1_project2_data_preparation.sas | ||
See included file for dataset properties | ||
; | ||
|
||
* environmental setup; | ||
|
||
* set relative file import path to current directory (using standard SAS trick); | ||
X "cd ""%substr(%sysget(SAS_EXECFILEPATH),1,%eval(%length(%sysget(SAS_EXECFILEPATH))-%length(%sysget(SAS_EXECFILENAME))))"""; | ||
|
||
|
||
* load external file that generates analytic datasets cde_2014_analytic_file, | ||
cde_2014_analytic_file_sort_frpm, and cde_2014_analytic_file_sort_sat; | ||
%include '.\STAT660-01_f18-team-1_project2_data_preparation.sas'; | ||
|
||
|
||
*******************************************************************************; | ||
* Research Question Analysis Starting Point; | ||
*******************************************************************************; | ||
|
||
title1 | ||
'Research Question:What are the top three members that had the highest annual income?' | ||
; | ||
|
||
title2 | ||
'Rationale: This should help identify three members that earn the most annually' | ||
; | ||
|
||
* | ||
Methodology: Use PROC PORT to sort the annual income in the combined dataset | ||
descendingly and use PROC PRINT to output the top 3 member id accordingly. | ||
Limitations: The highest top 3 annual income data are not visualized. | ||
Follow Up: We can add PROC SGPLOT statement to plot a bar graph, making the annual | ||
income differences more explicit. | ||
; | ||
|
||
|
||
|
||
title1 | ||
'Research Question: What is the average loan amount for each state?' | ||
; | ||
|
||
title2 | ||
'Rationale: This shows the lending situation from each state.' | ||
; | ||
|
||
* | ||
Methodology: Use the PROC MEANS statement to compute the mean loan amount. | ||
Limitations: We cannot know the distribution of the loan amount for each state. | ||
Follow Up: Add min, median, and max in the PROC MEANS statement to compute the | ||
five-number summaries. | ||
; | ||
|
||
|
||
|
||
title1 | ||
'Research Question: What is the purpose of the highest loan amount?' | ||
; | ||
|
||
title2 | ||
'Rationale: This would help identify what is the use of the largest amount of money borrowed.' | ||
; | ||
|
||
* | ||
Methodology: Use PROC SORT statement to sort the loan amount descendingly and find | ||
out the purpose of it. | ||
Limitations: The currency symbol for the loan amount is not clear. | ||
Follow Up: Use PROC FORMAT to add dollar sign in the data output. | ||
; | ||
|
||
|