From d4129b9bb95e0e0a5425adbdb9a9bec668f31615 Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sat, 2 Nov 2024 15:42:53 +0900
Subject: [PATCH 01/10] docs #16: update README.md

---
 README.md | 57 +++++++++++++++++++++++++++++--------------------------
 1 file changed, 30 insertions(+), 27 deletions(-)
diff --git a/README.md b/README.md
index 619f1e1..b5528cf 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,4 @@
-# @@@대문사진 
+# @@@ 1. 대문사진 
 
 ### 2022 Bigcontest data analysis field
 
@@ -35,45 +35,50 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
 
 - We chose **Muju Firefly Festival** and came up with measures to revitalize the festival.
 
-- The Muju Firefly Festival marks the 28th anniversary of this year in Muju, Jeonbuk-do, South Korea which comes with local agricultural product experiences, cultural performances, and environmental education programs focusing on firefly observation. Through this, it is a festival that promotes the importance of nature conservation and ecological preservation and contributes to revitalizing the local economy.
+- The Muju Firefly Festival marks the 28th anniversary of this year in Muju, Jeonbuk-do, South Korea which comes with local agricultural product experiences, cultural performances, and environmental education programs focusing on firefly observation.
 
 - We analyzed the current status of the festival and presented problem definitions and solutions accordingly.
 
 ##  2. Description of the data set <a name="section2"></a>
-There are 2 types of data we used in this analysis. One is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
+### 2.1 Initial steps <a name="sec2p1"></a>
+
+This is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
+
+@@@ 2. od_df 사진 
 
 The other is "stay_yyyymmdd_1.csv" data (hereinafter, stay data), which is the national administrative unit residence population data from 2023.09.01 to 2023.10.15.
 
-### 2.1 Initial steps <a name="sec2p1"></a>
-@@@od_df 사진 
+@@@ 3. stay_df 사진
+
 
-@@@stay_df 사진
+### 2.2 Visitor Analysis <a name="sec2p2"></a>
+#### This is the result of analyzing the number of people who visited Muju during the festival by age group.
 
-![head](images/head.JPG)
+@@@ 4. 연령별 방문인원 분포 사진
 
-### 2.2 Descriptive statistics <a name="sec2p2"></a>
-Pandas **describe()** can provide a quick summary of the data set as outlined in the notebook. However, without looking at the data in more detail, we cannot yet state what we think a typical diner is. What I mean is, just because most of the diners are male, smokers, and eating dinner on Saturday when we consider one variable at a time, that doesn't mean that all of these conditions are met simultaneously. In the notebook I calculate the tip as a fraction of the total bill as I think it's a measure of tip size that we are more familiar with. That is also done in the https://devarea.com/ reference below, in Wes McKinney's book when he is using the Tips data set as an example, and in the *Case Study 1: Restaurant Tipping* report, also below. So it seems like a sensible step to take. The output of pandas **describe(include="all")** is shown below. Here, all columns of the DataFrame are included in the analysis.
+From this summary we can say that:
+1. The percentage of visitors under 10s is the highest, followed by those in their 40s and 30s.
+2. From this, it can be inferred that a large number of family visitors have visited, accounting for a total of 78%.
+3. Among the remaining age groups, the proportion of people in their 20s is the highest, and the proportion of the remaining age groups (10s, 50s, 60s, 70s, and 80s) is less than 5%.
 
-![describeAll](images/describeAll.JPG)
+#### This is the result of analyzing the number of people who stayed Muju during the festival by age group.
+
+@@@ 5. 연령별 거주인원 분포사진
 
 From this summary we can say that:
-1. The average tip (as a fraction of total bill) is about 16%.
-2. The 50th percentile is very similar to the mean, so the mean tip is a typical value in the data set.
-3. More males than females paid the bill, 157 of the 244 observations.
-4. More non-smokers than smokers paid the bill, 151 of the 244 observations.
-5. Most of the observations relate to Saturday, 87 of the 244.
-6. Most of the observations relate to dinner, 176 of the 244.
-7. Party size varied from 1 to 6, with the average size being 2.5.
+1. The percentage of staying people 40s is the highest, followed by those in their 30s, under 10s and 30s.
+2. In the od data, few elderly people were observed, but the stay data clearly shows the ratio of those in their 50s to those in their 60s.
+
+#### This is the result of distribution of festival visitors' residence.
 
-I used pandas **iloc** to identify the highest and lowest tip rates:
-- The highest tip rate from a male smoker at dinner on Sunday in party size of 2, who left a 71% tip.
-- The lowest tip rate was also left by a male smoker at dinner in a party size of 2, but on Saturday; 3.6%.
+@@@ 6. 방문객 고향 사진
 
-This is what a plot of tip versus total bill looks like. Here, data from each day is plotted in a different colour, but the same could also be done for any of the other categorical variables sex, smoker, and time.
+From this summary we can say that:
+1. It can be seen that many visitors to the festival came from Jeonbuk and Chungnam/Daejeon.
+2. The average proportion of outsiders in Korea's festivals is 50%. It can be seen that the proportion of outsiders in the Muju Firefly Festival is 88% very high.
 
-![tipVSbill](images/tipVSbill.png)
 
-### 2.3 Start looking at categories of diner <a name="sec2p3"></a>
+### 2.3 Movement Analysis <a name="sec2p3"></a>
 We can use Pandas **groupby()** to get more detailed information about tipping behaviour for each category of diner. We are concerned with the fractional tip. From this part of the notebook, we can conclude that:
 1. It seems that non-smokers, regardless of their sex, leave similar tips (about 16%).
 2. On the other hand, for smokers, females leave higher tips than males on average (18% versus 15%).
@@ -82,12 +87,10 @@ We can use Pandas **groupby()** to get more detailed information about tipping b
 5. The highest average tip (as a fraction of total bill) is left at lunch on Fridays.
 6. The lowest average tip (as a fraction of total bill) is left at dinner on Saturdays.
 
-### 2.4 Plots to summarize some statistics <a name="sec2p4"></a>
-The following plots summarize this information graphically. So far it looks like the best time to be waiter in this restaurant is at lunch on Fridays if one is interested in the highest fractional tip. The best type of diner to serve is a female smoker. At this point of the analysis, I am not yet sure how the day and time variables are related to sex and smoker ones.
 
-![barSmokerSex](images/barSmokerSex.png)
 
-![barDayTime](images/barDayTime.png)
+
+
 
 ##  3. Regression <a name="section3"></a>
 For this part of the assessment, we have been asked to analyse if there is a relationship between the total bill and the tip amount. The simplest relationship would be a linear one. That's reasonable when we consider that tips (especially in the US) are usually a fixed percentage of the total bill. A linear model looks like:

From 9515bb181671e1c92913cff37b42b8efa60fb68d Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sat, 2 Nov 2024 18:31:45 +0900
Subject: [PATCH 02/10] docs #16: update README.md

---
 README.md | 69 ++++++++++++++++++++++++++++++++++++++++---------------
 1 file changed, 50 insertions(+), 19 deletions(-)

diff --git a/README.md b/README.md
index b5528cf..668920f 100644
--- a/README.md
+++ b/README.md
@@ -79,36 +79,67 @@ From this summary we can say that:
 
 
 ### 2.3 Movement Analysis <a name="sec2p3"></a>
-We can use Pandas **groupby()** to get more detailed information about tipping behaviour for each category of diner. We are concerned with the fractional tip. From this part of the notebook, we can conclude that:
-1. It seems that non-smokers, regardless of their sex, leave similar tips (about 16%).
-2. On the other hand, for smokers, females leave higher tips than males on average (18% versus 15%).
-3. The most frequently-occurring party size is 2 (156 of the total), followed by 3 (38), and 4 (37). There are only a handful of observations related to party sizes of 1, 5, and 6.
-4. The data set only contains information about dinner on Saturday (87 out of 244) and Sunday (76). There is one dinner observation on Thursday, the rest are lunch (61). Friday has lunch and dinner recorded, but overall numbers are small (19 in total).
-5. The highest average tip (as a fraction of total bill) is left at lunch on Fridays.
-6. The lowest average tip (as a fraction of total bill) is left at dinner on Saturdays.
+#### This is the result of the distribution of travel distance to Muju by age group.
 
+@@@ 7. 연령대별 이동거리 차이 사진
+
+From this summary we can say that:
+1. Those under 10s and 30s and 40s visit from various distances, ranging from close to far away.
+2. 10s, 20s, 50s, and 60s usually visit at close range.
+
+#### This is the distribution of transportation used by festival visitors.
+
+@@@ 8. 방문객 이용 교통수단 사진
+
+From this summary we can say that:
+- With 39019 cases of car use, most visitors visited the festival by car.
 
 
 
+3. [Problem definitions and solutions](#section3)
+    1. [Hypothesis Setting and Cause Analysis](#sec3p1)
+    2. [Proposition of Shuttle Buses](#sec3p2)
+    3. [Shuttle Bus Timetable](#sec3p3)
+    4. [Shuttle Bus Route](#sec3p4)
+
+
+##  3. Problem definitions and solutions <a name="section3"></a>
+### 3.1 Hypothesis Setting and Cause Analysis <a name="sec3p1"></a>
+#### 3.1.1 Hypothesis Setting
+
+1.  The means of off-vehicle transportation are poor.
+2.  There are restrictions on participation according to accessibility by age group.
+
+#### 3.1.2 Cause Analysis
+#### The results of transportation and time required to travel from major cities to Muju.
+
+Departure city | Travel Route | Time Required |
+------------|---------------|-------|
+Seoul           | Seoul Station - KTX - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
+Jeonju     | Jeonju Express Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
+Daegu  | Daegu Station - Mugunghwa Train - Yeongdong Station - city Bus - Muju Public Bus Terminal | 2h 40m |
+Busan     | Busan Station - SRT - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h |
+Gwangju     | Gwangju Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h 20m | 
+
+From this summary we can say that:
+- From other cities to Muju festival sites, the travel route is complicated and the travel time is too long.
+
+#### This shows the contents of the festival by time and the last bus time from the festival site to each city
+
+@@@@ 사진9
+
+From this summary we can say that:
+1. Bus services are limited to certain areas and time zones.
+2. The bus schedule does not match the time of the festival program, so we cannot use it when we return home.
+
 
 
-##  3. Regression <a name="section3"></a>
-For this part of the assessment, we have been asked to analyse if there is a relationship between the total bill and the tip amount. The simplest relationship would be a linear one. That's reasonable when we consider that tips (especially in the US) are usually a fixed percentage of the total bill. A linear model looks like:
 
-**y = m x + c**
 
-where
-- y is the tip
-- x is the total bill
-- m is the slope of the line
-- c is the y intercept
 
-### 3.1 Regression in Seaborn <a name="sec3p1"></a>
-In the notebook we first use Seaborn to visualize any linear relationship between our two variables of interest using **regplot** and **lmplot**. This does not give us any fitting parameters such as the slope and intercept of the linear fit, or any metrics to assess the quality of the fit, but it's a good start. Here we plot the best straight lines through smoker and non-smoker data points, as found by Seaborn. We will look at these categories again later on in this section. For now we can say that the best straight lines through the data points have different slopes for smokers and non-smoker. The shaded regions represent the 95% confidence levels, and they don't even overlap in this plot. 
 
-![SeabornFit](images/lmplotSmoke.png)
 
-### 3.2 Simple linear regression using polyfit <a name="sec3p2"></a>
+### 3.2 Proposition of Shuttle Buse <a name="sec3p2"></a>
 We perform a simple linear regression analysis of the data as per the week 9 lectures for this module. **numpy.polyfit** can calculate the slope and intercept of the best fit line based on least squares fitting. It doesn't directly return a metric, so we must use **numpy.corrcoef** to evaluate the strength of the linear relationship between the total bill and tip amount. This function returns a matrix from which we can calculate the R<sup>2</sup> value as explained in the reference below about Pearson and Spearman Correlation in Python. The fitting parameters for our linear model are: 
 - slope = 0.105
 - intercept = 0.920

From c750ec92ddc93a35a04f7628c2d42aa5dd209ad6 Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 00:20:40 +0900
Subject: [PATCH 03/10] docs #16: update README.md

---
 README.md | 155 +++++++++++++++++++++++++++---------------------------
 1 file changed, 77 insertions(+), 78 deletions(-)

diff --git a/README.md b/README.md
index 668920f..6b84967 100644
--- a/README.md
+++ b/README.md
@@ -94,15 +94,6 @@ From this summary we can say that:
 From this summary we can say that:
 - With 39019 cases of car use, most visitors visited the festival by car.
 
-
-
-3. [Problem definitions and solutions](#section3)
-    1. [Hypothesis Setting and Cause Analysis](#sec3p1)
-    2. [Proposition of Shuttle Buses](#sec3p2)
-    3. [Shuttle Bus Timetable](#sec3p3)
-    4. [Shuttle Bus Route](#sec3p4)
-
-
 ##  3. Problem definitions and solutions <a name="section3"></a>
 ### 3.1 Hypothesis Setting and Cause Analysis <a name="sec3p1"></a>
 #### 3.1.1 Hypothesis Setting
@@ -111,20 +102,20 @@ From this summary we can say that:
 2.  There are restrictions on participation according to accessibility by age group.
 
 #### 3.1.2 Cause Analysis
-#### The results of transportation and time required to travel from major cities to Muju.
+#### 3.1.2.1 The results of transportation and time required to travel from major cities to Muju.
 
-Departure city | Travel Route | Time Required |
+Departure City | Travel Route | Time Required |
 ------------|---------------|-------|
 Seoul           | Seoul Station - KTX - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
 Jeonju     | Jeonju Express Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
-Daegu  | Daegu Station - Mugunghwa Train - Yeongdong Station - city Bus - Muju Public Bus Terminal | 2h 40m |
+Daegu  | Daegu Station - Mugunghwa Train - Yeongdong Station - City Bus - Muju Public Bus Terminal | 2h 40m |
 Busan     | Busan Station - SRT - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h |
 Gwangju     | Gwangju Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h 20m | 
 
 From this summary we can say that:
 - From other cities to Muju festival sites, the travel route is complicated and the travel time is too long.
 
-#### This shows the contents of the festival by time and the last bus time from the festival site to each city
+#### 3.1.2.2 This shows the contents of the festival by time and the last bus time from the festival site to each city
 
 @@@@ 사진9
 
@@ -132,71 +123,87 @@ From this summary we can say that:
 1. Bus services are limited to certain areas and time zones.
 2. The bus schedule does not match the time of the festival program, so we cannot use it when we return home.
 
+#### 3.1.2.3 There are restrictions on participation in festivals due to differences in accessibility by age groups.
+1. In the case of 20s, the vehicle possession is low, so the dependence on public transportation is high, but the participation rate of the festival is low due to the weak public transportation situation to the festival venue.
+2. In the case of people in their 50s, the degree of interest in the festival can be confirmed by looking at the distribution of the number of people staying, but participation restrictions are expected due to fatigue caused by long-distance travel.
 
+#### 3.1.2.4 Improvements to the 27th Muju Firefly Festival (last year)
+ Rank | Content |
+------------|---------------|
+1           | Transportation |
+2     | The variety of festival food |
+3  | Good things to buy / Local specialties |
+4     | Event tour information |
 
-
-
-
+From this survey we can say that:
+- Many participants can see that they are uncomfortable with the transportation of the festival.
 
 
 ### 3.2 Proposition of Shuttle Buse <a name="sec3p2"></a>
-We perform a simple linear regression analysis of the data as per the week 9 lectures for this module. **numpy.polyfit** can calculate the slope and intercept of the best fit line based on least squares fitting. It doesn't directly return a metric, so we must use **numpy.corrcoef** to evaluate the strength of the linear relationship between the total bill and tip amount. This function returns a matrix from which we can calculate the R<sup>2</sup> value as explained in the reference below about Pearson and Spearman Correlation in Python. The fitting parameters for our linear model are: 
-- slope = 0.105
-- intercept = 0.920
-- R<sup>2</sup> = 0.457
-
-So, a linear relationship does exist between the total bill and the tip amount, but in my opinion, it's not a very strong one. When dealing with scientific data in the past, I would be looking for much higher R<sup>2</sup> values. I'll discuss that more below. The best slope here corresponds to approximately a 10% tip, and note that the intercept is not zero; suggesting that the minimum tip is about $1. One can see lines of data points representing tip of $1, $2 and $3. This suggests that lots of diners round their tips to the nearest $. 
-
-![SimpleLinReg](images/LSQalldata.png)
-
-### 3.3 Regression with statsmodels <a name="sec3p3"></a>
-We then move on to using two packages, statsmodels and scikit-learn, to perform linear regression and return fitting parameters and metrics. statsmodels is a Python package for performing statistical analysis of data - we are interested in the OLS (Ordinary Least Squares) module for performing linear regression. OLS involves fitting a linear model with coefficients to minimize the residual sum of squares between the observed data points and the best fit: for each data point, square the difference between it and the best fit, and sum all of these residuals. We modify the model slightly to include a y intercept. The model returns a report containing statistical information, but for this project we are only interested in the slope, intercept, and value of R<sup>2</sup>.
-
-### 3.4 Regression with scikit-learn <a name="sec3p4"></a>
-Scikit-learn is a machine learning package which can also perform OLS fitting. Strictly speaking there is no need to perform regression with both packages, but I do it once in the notebook and then stick to scikit-learn.  This package is useful for making predictions using the data set, something we may get on to later. We use the scikit-learn LinearRegression model which performs OLS fitting.
-
-In regression, R<sup>2</sup> is the coefficient of determination, a measure of how close the data points are to the fitted regression line; or how much of the variation in the data is explained by the linear model. It ranges from 0 to 1, and in general, higher values of R<sup>2</sup> are better. However, as the minitab reference below discusses, that's not the full story. That reference states that in fields which try to predict human behaviour (the tips data set falls into this category), values of R<sup>2</sup> less that 0.5 are not unusual; we find R<sup>2</sup> = 0.457 on average. It's also important to take into account the appropriateness of the model when assessing R<sup>2</sup>. Another model (perhaps a high-order polynomial fit) may produce a better value of R<sup>2</sup> but wouldn't be a sensible way to model how tip amount varies with total bill. 
-
-To conclude this part of the analysis: the tip does depend linearly on the total bill in this data set. The slope of the best fit line is 0.105, the y intercept is 0.920, and R<sup>2</sup> is 0.457.
-
-### 3.5 Linear regression on various subsets of the data <a name="sec3p5"></a>
-The results of regression on all of the data, and on subsets of it, are presented in the table below.
-
-Line fit    | R<sup>2</sup> | slope | intercept
-------------|---------------|-------|----------
-All           | 0.457     | 0.105 | 0.920     |
-size = 2      | 0.232     | 0.078 | 1.292     |
-size = 2,3,4  | 0.438     | 0.105 | 0.920     |
-F smokers     | 0.266     | 0.068 | 1.701     |
-M smokers     | 0.232     | 0.073 | 1.425     |
-F non-smokers | 0.686     | 0.128 | 0.452     |
-M non-smokers | 0.670     | 0.140 | 0.348     |
-day = Thur    | 0.660     | 0.128 | 0.512     |
-day = Fri     | 0.597     | 0.095 | 1.109     |
-day = Sat     | 0.495     | 0.121 | 0.519     |
-day = Sun     | 0.251     | 0.070 | 1.753     |
-
-What can we conclude from this? If higher R<sup>2</sup> indicates better a fit, then the data is fitted well by a linear model for non-smokers (regardless of sex) and for day = Thursday; these subsets result in the largest R<sup>2</sup> values and also high slopes. Maybe considering data from non-smokers on Thursday would produce the most reliable predictions of tip given total bill?
-
-**Tip predictions:**
-
-We can use our linear regression parameters to predict the tip amount for any total bill, say a bill of $100. 
-- Using all of the data, we predict a tip of $11.42 for this total bill amount;
-- For male non-smokers only, we predict a tip of $14.32;
-- Considering data from Thursday alone, we predict a tip of $13.29;
-- In contrast, Sunday data predicts just $8.77 as a tip.
-
-As the average total bill in this restaurant is just less than $20 and the maximum is about $50, it's unlikely that anyone would ever spend $100 here in the first place!
-
-## 4. Relationships between variables <a name="section4"></a>
+#### The need for a shuttle bus
+
+@@@@ 사진10
+
+
+### 3.3 Shuttle Bus Timetable <a name="sec3p3"></a>
+#### 3.1.1 To Muju
+@@@@ 사진11
+
+Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
+------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
+10 o'clock   |ㅤㅤㅤㅤ|🚌ㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|
+12 o'clock   |🚌🚌|🚌🚌|🚌|||||
+14 o'clock   |🚌🚌|🚌||||||
+16 o'clock   |🚌🚌🚌|🚌|||||🚌|
+18 o'clock   |🚌🚌🚌|🚌🚌|🚌|🚌|🚌|🚌|🚌🚌|
+20 o'clock   |🚌🚌||||||🚌|
+
+
+#### 3.1.2 To Return
+@@@@ 사진12
+
+Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
+------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
+12 o'clock   ||||||||
+14 o'clock   ||🚌||||||
+16 o'clock   |ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|ㅤㅤㅤㅤ|   
+18 o'clock   |🚌|🚌🚌||||||
+20 o'clock   |🚌🚌🚌|🚌🚌🚌|🚌|🚌|🚌|🚌|🚌🚌🚌|
+22 o'clock   |🚌🚌|🚌|||||🚌|
+
+
+### 3.4 Shuttle Bus Route <a name="sec3p4"></a>
+#### 3.4.1 Selection of the station
+#### 3.4.1.1 Select the city that visits Muju the most during the festival
+Daejeon Line | 
+------------| 
+Sejong City |
+Yuseong-gu, Daejeon |
+Seo-gu, Daejeon |
+Daedeok-gu, Daejeon | 
+Jung-gu, Daejeon | 
+Geumsan-gun, Chungnam |
+Yeongdong-gun, Chungbuk |
+
+Jeonbuk Line |
+---------------|
+Gunsan-si, Jeonbuk |
+Iksan-si, Jeonbuk |
+Wansan-gu, Jeonju-si, Jeonbuk |
+Deokjin-gu, Jeonju-si, Jeonbuk |
+Jinan-gun, Jeonbuk |
+Jangsu-gun, Jeonbuk |
+
+
+## 4. About Model <a name="section4"></a>
 We have investigated if the tip amount is related to the total bill, and we have explored a little how that relationship is different depending on the subsets of data used. We now want to analyse other relationships between the variables of the data set.   
 
-### 4.1 Visualize relationships between numerical variables with pairplot <a name="sec4p1"></a>
+### 4.1 Brief explanation <a name="sec4p1"></a>
 The Seaborn **pairplot** function plots pairwise relationships in a data set. It generates a grid of scatterplots of each numeric variable plotted against all the others, and a histogram of values when a variable is plotted against itself. The *hue* keyword can be used to differentiate between the different categorical variables on each subplot. Using pairplot on the tips data set suggests a possibility of a linear relationship between tip and total bill. Luckily, that is the relationship we were asked to investigate in the previous section. A variable could be used to separate categories if the histograms for different categories do not overlap too much. We don't see much evidence for that in the pairplot - unlike say in the iris data set - so I'll take it no further. Below is a Seaborn pairplot for this data set.
 
 ![Pairplot](images/Pairplot.png)
 
-### 4.2 Investigate relationships between tip amount and the other variables <a name="sec4p2"></a>
+### 4.2 Linear Programing <a name="sec4p2"></a>
 I next used the **pivot_table()** function to summarize the tip according to each of the other variables. I came across this function in Wes McKinney's data analysis book and in his "10 minutes to Pandas" video (both referenced below). So, instead of looking at the average tip for the entire data set, we can see what the average tip is for all combinations of the sex and smoker categorical variables, for example. The default aggregation function is **mean** and I also use **count** to measure sample sizes. **max()** and **min()** are used to find the biggest and smallest values returned from pivot_table.
 
 #### 4.2.1 Tip vs sex, smoker, and size
@@ -228,7 +235,7 @@ Rather than continuing on trying to find meaningful combinations of variables to
 - The lowest average tip is left by female smokers (and non-smokers) dining alone at dinner on Saturdays (mean = $1.00 , count = 1 each).
 - The largest group is male, non-smokers, dining with one other person at dinner on Sundays (mean = $2.59, count = 22). The average tip left by this group is very similar to the average tip for the whole data set, $2.99.
 
-### 4.3 Does the amount spent depend on party size? <a name="sec4p3"></a>
+### 4.3 Flow Chart <a name="sec4p3"></a>
 We will now look for any relationships between the tip or total bill amounts and the dining party size. Below is a plot of the total bill versus party size, with data clumped along the y axis at each party size integer value. We first calculate the correlation matrix and resulting R<sup>2</sup> for total bill and party size;  R<sup>2</sup> = 0.358 so there is a weak linear relationship there. The total bill does increase as party size increases, as you would expect. 
 
 ![TotalBill_size](images/TotalBill_Size.png)
@@ -255,19 +262,11 @@ Summary of findings:
 - In conclusion, larger parties spend more money in total, but each person in the party spends less than if they were part of a smaller group.
 - Alternatively, the reduction in bill and tip per person could be happening because these larger parties include children, and children's meals are usually less expensive than adult meals.
 
-### 4.4 Classification <a name="sec4p4"></a>
-The last thing we will do is see if we can use any of the numerical variables to predict some of the categorical ones. This is called classification, as we are attempting to predict the value of a discrete categorical variable like sex, smoker, day or time for this particular data set. The categorical variables correspond to classes; we wish to predict, for example, the value of the time class - is it lunch or dinner? For this part of the notebook we use scikit-learn, a machine learning package for Python. 
 
-### K-nearest neighbours (knn) classification
-The algorithm we use is called k-nearest neighbours (knn). As the scikit-learn documentation states, "Classification is computed from a simple majority vote of the nearest neighbours of each point: a query point is assigned the data class which has the most representatives within the nearest neighbours of the point." It is an example of supervised learning because we train the classifier with data where the outputs that correspond to certain inputs are already known. The training data is a random  selection of observations from the data set. The testing data consists of the remaining observations. Performance of the classifier is quantified by measuring how many of the outputs in the testing data it predicts correctly. In the notebook we use the numerical variables tip, total bill, and size to make predictions of time - lunch or dinner. The classification is performed using the full data set and also again for a subset of the data set that includes only non-smokers.
-- The knn classifier for 5 nearest neighbours has a 69% success rate at predicting time over 100 runs. It's much better than just guessing.
-- Considering only non-smokers reduces the performance slightly, to 65%.
-- The actual numbers change a little each time the notebook is run, but the full data set has always performed better than the subset.
 
-## 5. Work done by other people on the Tips data set <a name="section5"></a>
-The tips data set is often used to illustrate the capabilities of Seaborn, so it appears a lot in the documentation for that package. These and some other examples are listed in the references below. It was actually difficult to find something new to do with this data set, but I haven't come across an analysis like I did in section 4.3, where I looked at tip and total bill per person. 
 
-I found an anonymous report from Iowa State University on the tips data state which is referenced below. It seems to be a report for a statistics class but with a business bias. There is no code in the report. Indeed, I don't know what application was used to perform the analysis, but I'm guessing a pure statistics package as there is mention of t-values and p-values without explanation of what they are. In that analysis, the tip rate (or fractional tip) is fitted against sex, smoker, time, size and day (but not Sunday for some reason). They conclude that size is the most important predictor of tip rate, followed by Saturday data. They then fit tip rate against size alone; and conclude that the tip rate drops by about 1% for each additional diner. 
+
+
 
 ## 6. Conclusion <a name="conclusion"></a>
 The main findings of this analysis are:

From fa20e65913273008263951ff55e159832a4918ef Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 02:27:50 +0900
Subject: [PATCH 04/10] docs #16: update README.md

---
 README.md | 175 +++++++++++++++++++++++++++++++++++++-----------------
 1 file changed, 122 insertions(+), 53 deletions(-)

diff --git a/README.md b/README.md
index 6b84967..7622649 100644
--- a/README.md
+++ b/README.md
@@ -19,16 +19,11 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
     3. [Shuttle Bus Timetable](#sec3p3)
     4. [Shuttle Bus Route](#sec3p4)
        
-4. [About Model](#section4)
-    1. [Brief explanation](#sec4p1)
-    2. [Linear Programing](#sec4p2)
-    3. [Flow Chart](#sec4p3)
-    
-5. [Conclusion](#conclusion)
-   1. [Summary](#sec5p1)
-   2. [Expectation Effectiveness](#sec5p2)
+4. [Conclusion](#section4)
+    1. [Summary](#sec4p1)
+    2. [Expectation Effectiveness](#sec4p2)
 
-6. [References](#references)
+5. [References](#references)
 
 ## 1. Introduction <a name="introduction"></a>
 - The competition is a 2024 big contest data analysis field, and it is a competition that selects traditional markets or festivals as targets for analysis with data related to population movement provided by SKT.
@@ -174,36 +169,129 @@ Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 
 ### 3.4 Shuttle Bus Route <a name="sec3p4"></a>
 #### 3.4.1 Selection of the station
-#### 3.4.1.1 Select the city that visits Muju the most during the festival
-Daejeon Line | 
-------------| 
-Sejong City |
-Yuseong-gu, Daejeon |
-Seo-gu, Daejeon |
-Daedeok-gu, Daejeon | 
-Jung-gu, Daejeon | 
-Geumsan-gun, Chungnam |
-Yeongdong-gun, Chungbuk |
-
-Jeonbuk Line |
----------------|
-Gunsan-si, Jeonbuk |
-Iksan-si, Jeonbuk |
-Wansan-gu, Jeonju-si, Jeonbuk |
-Deokjin-gu, Jeonju-si, Jeonbuk |
-Jinan-gun, Jeonbuk |
-Jangsu-gun, Jeonbuk |
-
-
-## 4. About Model <a name="section4"></a>
+1. Select the city that visits Muju the most during the festival.
+```python
+map_filtered = pd.DataFrame()
+for region in festival_df_grouped_sido['시도명']:
+    # Calculate the sum of 'od_cnts' by 'Region' and sort by 'od_cnts'
+    region_filtered = festival_df_grouped_2[festival_df_grouped_2['시도명'] == region]
+    # Sejong city doesn't have 'City' name
+    if region == '세종특별자시치':
+        region_grouped = region_filtered.groupby('시도명')['od_cnts'].sum().reset_index()
+    else:
+        region_grouped = region_filtered.groupby('시군구명')['od_cnts'].sum().reset_index()
+    region_grouped['시도명'] = region
+    # Sort in descending order based on 'od_cnts'
+    region_grouped = region_grouped.sort_values(by='od_cnts', ascending=False)
+
+    map_filtered = pd.concat([map_filtered, region_grouped[region_grouped['od_cnts'] >= 500]])
+
+```
+map_filtered:
+City | # of Visitors
+------------| ------------| 
+Deokjin-gu, Jeonju-si, Jeonbuk | ㅤㅤ2834
+Seo-gu, Daejeon | ㅤㅤ1813
+Wansan-gu, Jeonju-si, Jeonbuk | ㅤㅤ1561
+Yuseong-gu, Daejeon | ㅤㅤ1379
+ㅤㅤㅤㅤㅤㅤ... | ㅤㅤ...
+Iksan-si, Jeonbuk | ㅤㅤ772
+Jinan-gun, Jeonbuk | ㅤㅤ704
+ㅤ
+   2. Give weight considering the number of visitors to the festival in the city.
+```json
+    "daejun": {
+        "nodes": [
+            "세종특별자치시",
+            "대전광역시 유성구",
+            "대전광역시 서구",
+            "대전광역시 대덕구",
+            "대전광역시 중구",
+            "충청남도 금산군",
+            "충청북도 영동군",
+            "전라북도 무주군"
+        ],
+        "weights": [1, 2, 2, 1, 1, 1, 2, 0]
+    },
+    "jeonbuk": {
+        "nodes": [
+            "전라북도 군산시",
+            "전라북도 익산시",
+            "전라북도 전주시 완산구",
+            "전라북도 전주시 덕진구",
+            "전라북도 진안군",
+            "전라북도 장수군",
+            "전라북도 무주군"
+        ],
+        "weights": [1, 1, 2, 2, 1, 2, 0]
+    }
+```
+ㅤ
+   3. Give weight considering the "percentage of age groups (20, 50, 60s)" that are difficult to travel long distances.
+```python
+def get_visitors_num(lst):
+    address = load_address()
+    sum_od_cnts_all, sum_od_cnts_age = number_of_visitors_to_Muju_by_region()
+    visitors_num = []
+    visitors_num_256 = []
+
+    for name in lst:
+        # Get the list of administrative district codes for each specified region
+        codes = address[address['시도 시군구'] == name]['행정동코드'].unique().tolist()
+        
+        # Initialize counters for total visitors and visitors in age groups 20s, 50s, and 60s
+        cnt_all = 0
+        cnt_256 = 0
+        
+        # Sum up visitors for each administrative district code in the region
+        for code in codes:
+            tmp_for_all = sum_od_cnts_all[sum_od_cnts_all['origin_hdong_cd'] == code]
+            if not tmp_for_all.empty:
+                cnt_all += tmp_for_all['od_cnts'].iloc[0]  # Total visitors from the administrative code
+            tmp_for_256 = sum_od_cnts_age[sum_od_cnts_age['origin_hdong_cd'] == code]
+            if not tmp_for_256.empty:
+                cnt_256 += tmp_for_256['od_cnts'].iloc[0]  # Total visitors from the specific age groups
+
+        visitors_num.append(int(cnt_all))
+        visitors_num_256.append(int(cnt_256))
+    
+    return visitors_num, visitors_num_256
+```
+
+```python
+def number_of_visitors_to_Muju_by_region():
+    df_od = load_od()
+    
+    # Number of visitors to Muju Festival by region
+    df_od_group = df_od.groupby(['origin_hdong_cd', 'date', 'age'])['od_cnts'].sum().reset_index()
+    df_od_all = df_od_group.groupby(['origin_hdong_cd', 'date'])['od_cnts'].sum().reset_index()  # Total visitors per day
+    sum_od_cnts_all = round(df_od_all.groupby(['origin_hdong_cd'])['od_cnts'].sum().reset_index(), 0)  # Total visitors from each region
+
+    # Number of Muju Festival visitors by region for age groups 20s, 50s, and 60s
+    df_od_age = df_od_group[df_od_group['age'].isin([2,5,6])]
+    sum_od_cnts_age = round(df_od_age.groupby('origin_hdong_cd')['od_cnts'].sum().reset_index(), 0)
+
+    return sum_od_cnts_all, sum_od_cnts_age
+```
+#### 3.4.2 Linear Programing
+@@@@@ 사진 13
+
+#### 3.4.3 Flow Chart
+@@@@ 사진 14
+#### 3.4.4 Recommended Route
+#### 3.4.4.1 Daejeon Line
+#### 3.4.4.2 Jeonbuk Line
+
+
+## 4. Conclusion <a name="section4"></a>
 We have investigated if the tip amount is related to the total bill, and we have explored a little how that relationship is different depending on the subsets of data used. We now want to analyse other relationships between the variables of the data set.   
 
-### 4.1 Brief explanation <a name="sec4p1"></a>
+### 4.1 Summary <a name="sec4p1"></a>
 The Seaborn **pairplot** function plots pairwise relationships in a data set. It generates a grid of scatterplots of each numeric variable plotted against all the others, and a histogram of values when a variable is plotted against itself. The *hue* keyword can be used to differentiate between the different categorical variables on each subplot. Using pairplot on the tips data set suggests a possibility of a linear relationship between tip and total bill. Luckily, that is the relationship we were asked to investigate in the previous section. A variable could be used to separate categories if the histograms for different categories do not overlap too much. We don't see much evidence for that in the pairplot - unlike say in the iris data set - so I'll take it no further. Below is a Seaborn pairplot for this data set.
 
 ![Pairplot](images/Pairplot.png)
 
-### 4.2 Linear Programing <a name="sec4p2"></a>
+### 4.2 Expectation Effectiveness <a name="sec4p2"></a>
 I next used the **pivot_table()** function to summarize the tip according to each of the other variables. I came across this function in Wes McKinney's data analysis book and in his "10 minutes to Pandas" video (both referenced below). So, instead of looking at the average tip for the entire data set, we can see what the average tip is for all combinations of the sex and smoker categorical variables, for example. The default aggregation function is **mean** and I also use **count** to measure sample sizes. **max()** and **min()** are used to find the biggest and smallest values returned from pivot_table.
 
 #### 4.2.1 Tip vs sex, smoker, and size
@@ -264,26 +352,7 @@ Summary of findings:
 
 
 
-
-
-
-
-## 6. Conclusion <a name="conclusion"></a>
-The main findings of this analysis are:
-1. Average tip = $2.99, minimum = $1, maximum = $10.
-2. Average total bill = $19.79, minimum = $3.07, maximum = $50.81.
-3. Average fractional tip = 0.16, minimum = 0.04, maximum = 0.71.
-4. 151 of the 244 observations concern non-smokers.
-5. 157 of the 244 observations concern males.
-6. 87 of the 244 observations relate to Saturday.
-7. 176 of the 244 observations relate to dinner.
-8. 156 of the 244 observations concern party size of two.
-9. The largest group represented in the data set is: male, non-smokers, dining with one other person at dinner on Sundays. There are 22 of them. The average tip left by this group is $2.59, very similar to the average tip for the whole data set, $2.99. 
-10. There is a linear relationship between tip and total bill: tip = 0.11 (total_bill) + 0.92
-11. Larger parties spend more money in total, but each person in the party spends less than if they were part of a smaller group. The same applies to the tip amount. The tip per person versus size is fit really well by a linear model. 
-12. A k-nearest neighbours classifier was used to predict time from tip, total bill and size inputs. The classifier predicted the time (lunch or dinner) correctly about 69% of the time. Considering only data from non-smokers reduced the classifier performance by a few percent.
-
-## 7. References <a name="references"></a>
+## 5. References <a name="references"></a>
 
 **General:**
 

From be3db63b1b6ff44166ce0b4ae31e7aefb1e2530b Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 03:05:34 +0900
Subject: [PATCH 05/10] docs #16: update README.md

---
 README.md | 148 ++++++++++++++++++++----------------------------------
 1 file changed, 55 insertions(+), 93 deletions(-)

diff --git a/README.md b/README.md
index 7622649..20f72be 100644
--- a/README.md
+++ b/README.md
@@ -1,4 +1,4 @@
-# @@@ 1. 대문사진 
+@@@ 1. 대문사진 
 
 ### 2022 Bigcontest data analysis field
 
@@ -10,22 +10,22 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
 
 2. [Description of the data set](#section2)
     1. [Initial steps](#sec2p1)
-    2. [Visitor Analysis](#sec2p2)
-    3. [Movement Analysis](#sec2p3)
+    2. [Visitor analysis](#sec2p2)
+    3. [Movement analysis](#sec2p3)
 
 3. [Problem definitions and solutions](#section3)
-    1. [Hypothesis Setting and Cause Analysis](#sec3p1)
-    2. [Proposition of Shuttle Buses](#sec3p2)
-    3. [Shuttle Bus Timetable](#sec3p3)
-    4. [Shuttle Bus Route](#sec3p4)
+    1. [Hypothesis setting and cause analysis](#sec3p1)
+    2. [Proposition of shuttle buses](#sec3p2)
+    3. [Shuttle bus timetable](#sec3p3)
+    4. [Shuttle bus route](#sec3p4)
        
 4. [Conclusion](#section4)
     1. [Summary](#sec4p1)
-    2. [Expectation Effectiveness](#sec4p2)
+    2. [Expectation effectiveness](#sec4p2)
 
 5. [References](#references)
 
-## 1. Introduction <a name="introduction"></a>
+# 1. Introduction <a name="introduction"></a>
 - The competition is a 2024 big contest data analysis field, and it is a competition that selects traditional markets or festivals as targets for analysis with data related to population movement provided by SKT.
 
 - We chose **Muju Firefly Festival** and came up with measures to revitalize the festival.
@@ -34,8 +34,8 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
 
 - We analyzed the current status of the festival and presented problem definitions and solutions accordingly.
 
-##  2. Description of the data set <a name="section2"></a>
-### 2.1 Initial steps <a name="sec2p1"></a>
+#  2. Description of the dataset <a name="section2"></a>
+## 2.1 Initial steps <a name="sec2p1"></a>
 
 This is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
 
@@ -46,8 +46,8 @@ The other is "stay_yyyymmdd_1.csv" data (hereinafter, stay data), which is the n
 @@@ 3. stay_df 사진
 
 
-### 2.2 Visitor Analysis <a name="sec2p2"></a>
-#### This is the result of analyzing the number of people who visited Muju during the festival by age group.
+## 2.2 Visitor analysis <a name="sec2p2"></a>
+### This is the result of analyzing the number of people who visited Muju during the festival by age group.
 
 @@@ 4. 연령별 방문인원 분포 사진
 
@@ -56,7 +56,7 @@ From this summary we can say that:
 2. From this, it can be inferred that a large number of family visitors have visited, accounting for a total of 78%.
 3. Among the remaining age groups, the proportion of people in their 20s is the highest, and the proportion of the remaining age groups (10s, 50s, 60s, 70s, and 80s) is less than 5%.
 
-#### This is the result of analyzing the number of people who stayed Muju during the festival by age group.
+### This is the result of analyzing the number of people who stayed Muju during the festival by age group.
 
 @@@ 5. 연령별 거주인원 분포사진
 
@@ -64,7 +64,7 @@ From this summary we can say that:
 1. The percentage of staying people 40s is the highest, followed by those in their 30s, under 10s and 30s.
 2. In the od data, few elderly people were observed, but the stay data clearly shows the ratio of those in their 50s to those in their 60s.
 
-#### This is the result of distribution of festival visitors' residence.
+### This is the result of distribution of festival visitors' residence.
 
 @@@ 6. 방문객 고향 사진
 
@@ -73,8 +73,8 @@ From this summary we can say that:
 2. The average proportion of outsiders in Korea's festivals is 50%. It can be seen that the proportion of outsiders in the Muju Firefly Festival is 88% very high.
 
 
-### 2.3 Movement Analysis <a name="sec2p3"></a>
-#### This is the result of the distribution of travel distance to Muju by age group.
+## 2.3 Movement analysis <a name="sec2p3"></a>
+### 2.3.1 Result of the distribution of travel distance to Muju by age group.
 
 @@@ 7. 연령대별 이동거리 차이 사진
 
@@ -82,24 +82,24 @@ From this summary we can say that:
 1. Those under 10s and 30s and 40s visit from various distances, ranging from close to far away.
 2. 10s, 20s, 50s, and 60s usually visit at close range.
 
-#### This is the distribution of transportation used by festival visitors.
+### 2.3.2 The distribution of transportation used by festival visitors.
 
 @@@ 8. 방문객 이용 교통수단 사진
 
 From this summary we can say that:
 - With 39019 cases of car use, most visitors visited the festival by car.
 
-##  3. Problem definitions and solutions <a name="section3"></a>
-### 3.1 Hypothesis Setting and Cause Analysis <a name="sec3p1"></a>
-#### 3.1.1 Hypothesis Setting
+#  3. Problem definitions and solutions <a name="section3"></a>
+## 3.1 Hypothesis setting and cause analysis <a name="sec3p1"></a>
+### 3.1.1 Hypothesis setting
 
 1.  The means of off-vehicle transportation are poor.
 2.  There are restrictions on participation according to accessibility by age group.
 
-#### 3.1.2 Cause Analysis
+### 3.1.2 Cause analysis
 #### 3.1.2.1 The results of transportation and time required to travel from major cities to Muju.
 
-Departure City | Travel Route | Time Required |
+Departure City | Travel Route | Time |
 ------------|---------------|-------|
 Seoul           | Seoul Station - KTX - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
 Jeonju     | Jeonju Express Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
@@ -134,14 +134,14 @@ From this survey we can say that:
 - Many participants can see that they are uncomfortable with the transportation of the festival.
 
 
-### 3.2 Proposition of Shuttle Buse <a name="sec3p2"></a>
-#### The need for a shuttle bus
+## 3.2 Proposition of shuttle bus <a name="sec3p2"></a>
+### The need for a shuttle bus
 
 @@@@ 사진10
 
 
-### 3.3 Shuttle Bus Timetable <a name="sec3p3"></a>
-#### 3.1.1 To Muju
+## 3.3 Shuttle bus rimetable <a name="sec3p3"></a>
+### 3.1.1 To Muju
 @@@@ 사진11
 
 Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
@@ -154,7 +154,7 @@ Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 20 o'clock   |🚌🚌||||||🚌|
 
 
-#### 3.1.2 To Return
+### 3.1.2 To Return
 @@@@ 사진12
 
 Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
@@ -167,8 +167,8 @@ Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 22 o'clock   |🚌🚌|🚌|||||🚌|
 
 
-### 3.4 Shuttle Bus Route <a name="sec3p4"></a>
-#### 3.4.1 Selection of the station
+## 3.4 Shuttle bus route <a name="sec3p4"></a>
+### 3.4.1 Selection of the station
 1. Select the city that visits Muju the most during the festival.
 ```python
 map_filtered = pd.DataFrame()
@@ -197,7 +197,8 @@ Yuseong-gu, Daejeon | ㅤㅤ1379
 ㅤㅤㅤㅤㅤㅤ... | ㅤㅤ...
 Iksan-si, Jeonbuk | ㅤㅤ772
 Jinan-gun, Jeonbuk | ㅤㅤ704
-ㅤ
+
+
    2. Give weight considering the number of visitors to the festival in the city.
 ```json
     "daejun": {
@@ -273,82 +274,43 @@ def number_of_visitors_to_Muju_by_region():
 
     return sum_od_cnts_all, sum_od_cnts_age
 ```
-#### 3.4.2 Linear Programing
+### 3.4.2 Linear Programing
 @@@@@ 사진 13
 
-#### 3.4.3 Flow Chart
+### 3.4.3 Flow Chart
 @@@@ 사진 14
-#### 3.4.4 Recommended Route
-#### 3.4.4.1 Daejeon Line
-#### 3.4.4.2 Jeonbuk Line
-
-
-## 4. Conclusion <a name="section4"></a>
-We have investigated if the tip amount is related to the total bill, and we have explored a little how that relationship is different depending on the subsets of data used. We now want to analyse other relationships between the variables of the data set.   
-
-### 4.1 Summary <a name="sec4p1"></a>
-The Seaborn **pairplot** function plots pairwise relationships in a data set. It generates a grid of scatterplots of each numeric variable plotted against all the others, and a histogram of values when a variable is plotted against itself. The *hue* keyword can be used to differentiate between the different categorical variables on each subplot. Using pairplot on the tips data set suggests a possibility of a linear relationship between tip and total bill. Luckily, that is the relationship we were asked to investigate in the previous section. A variable could be used to separate categories if the histograms for different categories do not overlap too much. We don't see much evidence for that in the pairplot - unlike say in the iris data set - so I'll take it no further. Below is a Seaborn pairplot for this data set.
-
-![Pairplot](images/Pairplot.png)
-
-### 4.2 Expectation Effectiveness <a name="sec4p2"></a>
-I next used the **pivot_table()** function to summarize the tip according to each of the other variables. I came across this function in Wes McKinney's data analysis book and in his "10 minutes to Pandas" video (both referenced below). So, instead of looking at the average tip for the entire data set, we can see what the average tip is for all combinations of the sex and smoker categorical variables, for example. The default aggregation function is **mean** and I also use **count** to measure sample sizes. **max()** and **min()** are used to find the biggest and smallest values returned from pivot_table.
 
-#### 4.2.1 Tip vs sex, smoker, and size
-
-The output of pivot_table for average tip summarized against these three variables looks like:
-
-![PT_SexSmokerSize](images/PT_SexSmokerSize.JPG)
-
-Using the count function with pivot_table returns:
-
-![PTcount_SexSmokerSize](images/PTcount_SexSmokerSize.JPG)
-
-From this part of the notebook we can say that:
-- The largest average tip is left by male non-smokers in a party of 6 (mean = $5.85 , count = 2).
-- The lowest average tip is left by female smokers dining alone (mean = $1.00 , count = 1).
-- The largest group is male non-smokers in a party of 2 (mean = $2.55, count = 57).
-
-#### 4.2.2 Tip vs sex, smoker, and day
-
-We then used a pivot_table to calculate averages of tip over sex, smoker and day variables. I won't include the table itself here, just the main results we are interested in, namely:
-- The largest average tip is left by male smokers on Sundays (mean = $3.52 , count = 15).
-- The lowest average tip is left by female non-smokers on Thursdays (mean =$2.46, count = 25).
-- The largest group is male non-smokers on Sundays (mean = $3.12, count = 43).
-
-#### 4.2.3 Tip vs sex, smoker, day, time and size
+### 3.4.4 Recommended Route
+#### 3.4.4.1 Daejeon Line
 
-Rather than continuing on trying to find meaningful combinations of variables to use, I finally realised that I could make a pivot table summarizing tip averages over five other variables. The table is huge and not easy to read, but the main findings are:
-- The highest average tip comes from male non-smokers at lunch on Thursday in a party of six (mean = $6.70 , count = 1).
-- The lowest average tip is left by female smokers (and non-smokers) dining alone at dinner on Saturdays (mean = $1.00 , count = 1 each).
-- The largest group is male, non-smokers, dining with one other person at dinner on Sundays (mean = $2.59, count = 22). The average tip left by this group is very similar to the average tip for the whole data set, $2.99.
+@@@ 사진 15
 
-### 4.3 Flow Chart <a name="sec4p3"></a>
-We will now look for any relationships between the tip or total bill amounts and the dining party size. Below is a plot of the total bill versus party size, with data clumped along the y axis at each party size integer value. We first calculate the correlation matrix and resulting R<sup>2</sup> for total bill and party size;  R<sup>2</sup> = 0.358 so there is a weak linear relationship there. The total bill does increase as party size increases, as you would expect. 
+Rank | Travel Route | Distance | Time |
+------------|---------------|---------------|---------------|
+ㅤ1 | Seo-gu, Daejeon → Daedeok-gu, Daejeon → Jung-gu, Daejeon → Muju-gun, Jeonbuk | 61.58km | 1h 37m |
+ㅤ2 | Seo-gu, Daejeon → Yuseong-gu, Daejeon → Jung-gu, Daejeon → Muju-gun, Jeonbuk | 68.50km | 1h 45m |
+ㅤ3 | Seo-gu, Daejeon → Yuseong-gun, Daejeon → Geumsan-gun, Chungnam → Muju-gun, Jeonbuk | 83.24km | 1h 39m |
+ㅤ4 | Sejong City → Daedeok-gu, Daejeon → Jung-gu, Daejeon → Muju-gun, Jeonbuk | 83.64km | 2h 2m |
 
-![TotalBill_size](images/TotalBill_Size.png)
+#### 3.4.4.2 Jeonbuk Line
 
-The tip also increases as party size increases but I did not perform any regression on that data. Instead I decided to look at the total bill or tip *per person*. For this, two new columns are added to the data set:
-- tpp or tip per person = tip / size
-- bpp or bill per person = total bill / size
+@@@ 사진 16
 
-Pandas **groupby()** is then used to calculate the average tip or bill per person for each party size. Be aware that we don't have a lot of data for party sizes of 1, 5, or 6. Simple linear regression (using numpy.polyfit) was then performed on these average values. I wanted to see if the average bill (or tip) per person was linearly related to party size. The resulting plots and fit parameters are:
+Rank | Travel Route | Distance | Time |
+------------|---------------|---------------|---------------|
+ㅤ1 | Gunsan-si, Jeonbuk → Iksan-si, Jeonbuk → Jinan-gun, Jeonbuk → Muju-gun, Jeonbuk | 133.84km | 2h 21m |
+ㅤ2 | Gunsan-si, Jeonbuk → Deokjin-gu, Jeonju-si, Jeonbuk → Jinan-gun, Jeonbuk → Muju-gun, Jeonbuk | 120.89km | 2h 34m |
+ㅤ3 | Gunsan-si, Jeonbuk → Iksan-si, Jeonbuk → Wansan-gu, Jeonju-si, Jeonbuk → Muju-gun, Jeonbuk | 131.27km | 2h 50m |
+ㅤ4 | Gunsan-si, Jeonbuk → Iksan-si, Jeonbuk → Jangsu-gun, Jeonbuk → Muju-gun, Jeonbuk | 165.31km | 2h 37m |
 
-![OLSbpp](images/LSQbpp.png)
 
-Slope = -0.412130, intercept = 8.475404, R<sup>2</sup> = 0.653
+# 4. Conclusion <a name="section4"></a>
+  
 
-![OLStpp](images/LSQtpp.png)
+## 4.1 Summary <a name="sec4p1"></a>
 
-Slope = -0.125348, intercept = 1.533718, R<sup>2</sup> = 0.933
 
-Summary of findings:
-- The average bill per person decreases as party size increases.
-- There is a good linear relationship (high R<sup>2</sup>) between the average bill per person and party size.
-- The average tip per person also decreases as party size increases.
-- There is a very strong linear relationship (high R<sup>2</sup>) between the average tip per person and party size.
-- In conclusion, larger parties spend more money in total, but each person in the party spends less than if they were part of a smaller group.
-- Alternatively, the reduction in bill and tip per person could be happening because these larger parties include children, and children's meals are usually less expensive than adult meals.
+### 4.2 Expectation Effectiveness <a name="sec4p2"></a>
 
 
 

From fe610147b73deee3b48cc82d3ea30e49684a8e25 Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 21:05:34 +0900
Subject: [PATCH 06/10] docs #16: update README.md

---
 README.md | 210 +++++++++++++++++-------------------------------------
 1 file changed, 66 insertions(+), 144 deletions(-)

diff --git a/README.md b/README.md
index 20f72be..daadb28 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,4 @@
-@@@ 1. 대문사진 
-
+![1](https://github.com/user-attachments/assets/3f4fed98-7962-4996-93e7-d8b9eb0cc361)
 ### 2022 Bigcontest data analysis field
 
 Git-hub repository at:
@@ -37,36 +36,36 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
 #  2. Description of the dataset <a name="section2"></a>
 ## 2.1 Initial steps <a name="sec2p1"></a>
 
-This is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
+- This is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
 
-@@@ 2. od_df 사진 
+![2](https://github.com/user-attachments/assets/e44e5ded-101d-41b3-854a-b168218f0764)
 
-The other is "stay_yyyymmdd_1.csv" data (hereinafter, stay data), which is the national administrative unit residence population data from 2023.09.01 to 2023.10.15.
+- The other is "stay_yyyymmdd_1.csv" data (hereinafter, stay data), which is the national administrative unit residence population data from 2023.09.01 to 2023.10.15.
 
-@@@ 3. stay_df 사진
+![3](https://github.com/user-attachments/assets/252f80c4-fe6f-4f4d-b855-fb3c05861e81)
 
 
 ## 2.2 Visitor analysis <a name="sec2p2"></a>
-### This is the result of analyzing the number of people who visited Muju during the festival by age group.
+### 2.2.1 Result of analyzing the number of people who visited Muju during the festival by age group.
 
-@@@ 4. 연령별 방문인원 분포 사진
+![4](https://github.com/user-attachments/assets/e52569e5-3986-4c8a-b5b4-4abb33bd4002)
 
 From this summary we can say that:
 1. The percentage of visitors under 10s is the highest, followed by those in their 40s and 30s.
 2. From this, it can be inferred that a large number of family visitors have visited, accounting for a total of 78%.
 3. Among the remaining age groups, the proportion of people in their 20s is the highest, and the proportion of the remaining age groups (10s, 50s, 60s, 70s, and 80s) is less than 5%.
 
-### This is the result of analyzing the number of people who stayed Muju during the festival by age group.
+### 2.2.2 Result of analyzing the number of people who stayed Muju during the festival by age group.
 
-@@@ 5. 연령별 거주인원 분포사진
+![5](https://github.com/user-attachments/assets/8db12ac8-a353-476e-bc65-42db7a8ffff6)
 
 From this summary we can say that:
 1. The percentage of staying people 40s is the highest, followed by those in their 30s, under 10s and 30s.
 2. In the od data, few elderly people were observed, but the stay data clearly shows the ratio of those in their 50s to those in their 60s.
 
-### This is the result of distribution of festival visitors' residence.
+### 2.2.3 Result of distribution of festival visitors' residence.
 
-@@@ 6. 방문객 고향 사진
+![6](https://github.com/user-attachments/assets/90005650-4b72-47b4-8ba4-67fb707b6548)
 
 From this summary we can say that:
 1. It can be seen that many visitors to the festival came from Jeonbuk and Chungnam/Daejeon.
@@ -76,7 +75,7 @@ From this summary we can say that:
 ## 2.3 Movement analysis <a name="sec2p3"></a>
 ### 2.3.1 Result of the distribution of travel distance to Muju by age group.
 
-@@@ 7. 연령대별 이동거리 차이 사진
+![7](https://github.com/user-attachments/assets/2462d208-d46b-48e8-b3d2-49a5879d7f4a)
 
 From this summary we can say that:
 1. Those under 10s and 30s and 40s visit from various distances, ranging from close to far away.
@@ -84,7 +83,7 @@ From this summary we can say that:
 
 ### 2.3.2 The distribution of transportation used by festival visitors.
 
-@@@ 8. 방문객 이용 교통수단 사진
+![8](https://github.com/user-attachments/assets/95ff40ef-124d-4605-b6a5-0919d49191b5)
 
 From this summary we can say that:
 - With 39019 cases of car use, most visitors visited the festival by car.
@@ -99,20 +98,20 @@ From this summary we can say that:
 ### 3.1.2 Cause analysis
 #### 3.1.2.1 The results of transportation and time required to travel from major cities to Muju.
 
-Departure City | Travel Route | Time |
+Departure | Travel Route | Time |
 ------------|---------------|-------|
-Seoul           | Seoul Station - KTX - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
-Jeonju     | Jeonju Express Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 2h 30m |
-Daegu  | Daegu Station - Mugunghwa Train - Yeongdong Station - City Bus - Muju Public Bus Terminal | 2h 40m |
-Busan     | Busan Station - SRT - Daejeon Station - City Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h |
-Gwangju     | Gwangju Bus Terminal - Express Bus - Daejeon Complex Terminal - Intercity Bus - Muju Public Bus Terminal | 3h 20m | 
+Seoul           | Seoul Station (KT) → Daejeon Station (City Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 2h 30m |
+Jeonju     | Jeonju Express Bus Terminal (Express Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 2h 30m |
+Daegu  | Daegu Station (Mugunghwa Train) → Yeongdong Station (City Bus) → Muju Bus Terminal | 2h 40m |
+Busan     | Busan Station (SRT) → Daejeon Station (City Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 3h |
+Gwangju     | Gwangju Bus Terminal (Express Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 3h 20m | 
 
 From this summary we can say that:
 - From other cities to Muju festival sites, the travel route is complicated and the travel time is too long.
 
 #### 3.1.2.2 This shows the contents of the festival by time and the last bus time from the festival site to each city
 
-@@@@ 사진9
+![9](https://github.com/user-attachments/assets/0e0b0fe5-369e-47ed-a524-4ccb00366553)
 
 From this summary we can say that:
 1. Bus services are limited to certain areas and time zones.
@@ -125,10 +124,10 @@ From this summary we can say that:
 #### 3.1.2.4 Improvements to the 27th Muju Firefly Festival (last year)
  Rank | Content |
 ------------|---------------|
-1           | Transportation |
-2     | The variety of festival food |
-3  | Good things to buy / Local specialties |
-4     | Event tour information |
+ㅤ1           | Transportation |
+ㅤ2     | The variety of festival food |
+ㅤ3  | Good things to buy / Local specialties |
+ㅤ4     | Event tour information |
 
 From this survey we can say that:
 - Many participants can see that they are uncomfortable with the transportation of the festival.
@@ -137,12 +136,13 @@ From this survey we can say that:
 ## 3.2 Proposition of shuttle bus <a name="sec3p2"></a>
 ### The need for a shuttle bus
 
-@@@@ 사진10
+![10](https://github.com/user-attachments/assets/65fe38cf-c70a-4f3d-b382-0b4d94a7d783)
 
 
 ## 3.3 Shuttle bus rimetable <a name="sec3p3"></a>
 ### 3.1.1 To Muju
-@@@@ 사진11
+
+![11](https://github.com/user-attachments/assets/61cc4717-acba-4088-8a25-ed9ec87164d0)
 
 Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 ------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
@@ -155,7 +155,8 @@ Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 
 
 ### 3.1.2 To Return
-@@@@ 사진12
+
+![12](https://github.com/user-attachments/assets/2136989f-1ba8-4fb5-9e2d-aa73b94ad058)
 
 Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 ------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
@@ -275,15 +276,17 @@ def number_of_visitors_to_Muju_by_region():
     return sum_od_cnts_all, sum_od_cnts_age
 ```
 ### 3.4.2 Linear Programing
-@@@@@ 사진 13
+
+![13](https://github.com/user-attachments/assets/bc839c99-8357-4ad6-818d-61001801cd1e)
 
 ### 3.4.3 Flow Chart
-@@@@ 사진 14
+
+<img src="https://github.com/user-attachments/assets/bee931c6-d9a3-4301-88dd-f6c2de8b52f6" alt="14" width="500"/>
 
 ### 3.4.4 Recommended Route
 #### 3.4.4.1 Daejeon Line
 
-@@@ 사진 15
+<img src="https://github.com/user-attachments/assets/b6939615-35cf-4055-8d98-018824cc31a2" alt="15" width="500"/>
 
 Rank | Travel Route | Distance | Time |
 ------------|---------------|---------------|---------------|
@@ -292,9 +295,11 @@ Rank | Travel Route | Distance | Time |
 ㅤ3 | Seo-gu, Daejeon → Yuseong-gun, Daejeon → Geumsan-gun, Chungnam → Muju-gun, Jeonbuk | 83.24km | 1h 39m |
 ㅤ4 | Sejong City → Daedeok-gu, Daejeon → Jung-gu, Daejeon → Muju-gun, Jeonbuk | 83.64km | 2h 2m |
 
+---------------------------------------------
+
 #### 3.4.4.2 Jeonbuk Line
 
-@@@ 사진 16
+![16](https://github.com/user-attachments/assets/9c351b94-87bd-4f06-89de-a621f191e217)
 
 Rank | Travel Route | Distance | Time |
 ------------|---------------|---------------|---------------|
@@ -305,134 +310,51 @@ Rank | Travel Route | Distance | Time |
 
 
 # 4. Conclusion <a name="section4"></a>
-  
-
 ## 4.1 Summary <a name="sec4p1"></a>
+#### 4.1.1 Analyze Muju festival and define problem: Lack of access to transportation to festival sites
+- A high percentage of outsiders
 
+- Lack of public transport infrastructure
 
-### 4.2 Expectation Effectiveness <a name="sec4p2"></a>
-
-
-
-## 5. References <a name="references"></a>
-
-**General:**
-
-- [1]  Anaconda Distribution
-https://www.anaconda.com/
-
-- [2] Python Software Foundation
-https://www.python.org/
-
-- [3] Project Jupyter
-https://jupyter.org/
-
-- [4] Sharing Jupyter notebooks
-https://nbviewer.jupyter.org/
-
-- [5] seaborn: statistical data visualization
-https://seaborn.pydata.org/index.html#
-
-- [6] matplotlib: Python plotting library
-https://matplotlib.org/
-
-- [7] The Tips data set from Michael Waskom
-https://github.com/mwaskom/seaborn-data/blob/master/tips.csv
+- Targeting for various age groups
 
-- [8] Description of what is contained in the tips set
-https://www.kaggle.com/ranjeetjain3/seaborn-tips-data set
+- Lack of access to certain age groups
 
-- [9] scikit-learn: Machine Learning in Python
-https://scikit-learn.org/stable/index.html
+#### 4.1.2 Solution: Propose introduction of shuttle buses connecting major cities during the festival
+- Propose a timetable through analysis of visitor data by day and hour
 
-- [10] statsmodels: Statistics in Python
-https://www.statsmodels.org/stable/index.html
+- Propose a route through analysis of visitor data by region
 
-- [11] scipy.stats : Statistics with SciPy
-https://docs.scipy.org/doc/scipy/reference/tutorial/stats.html
-
-**Exploratory data analysis:**
-
-- [12] Exploratory Statistical Data Analysis with a Real data set using Pandas
-https://towardsdatascience.com/exploratory-statistical-data-analysis-with-a-real-data set-using-pandas-208007798b92
-
-- [13] How to investigate a data set with Python
-https://towardsdatascience.com/hitchhikers-guide-to-exploratory-data-analysis-6e8d896d3f7e
-
-- [14] Data analysis with Python
-https://medium.com/@onpillow/01-investigate-tmdb-movie-data set-python-data-analysis-project-part-1-data-wrangling-3d2b55ea7714
-
-- [15] Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. 
-Wes McKinney. ISBN-13: 978-1491957660 ISBN-10: 1491957662
-
-- [16] Pandas In 10 Minutes || Wes McKinney
-https://www.youtube.com/watch?v=1MGCD8SQp3k
-
-- [17] Good description of quartiles on Seaborn plots
-https://towardsdatascience.com/analyze-the-data-through-data-visualization-using-seaborn-255e1cd3948e
-
-**Regression:**
-
-- [18] Ordinary Least Squares in statsmodels
-https://www.statsmodels.org/dev/examples/notebooks/generated/ols.html
-
-- [19] Generalized Linear Models in scikit-learn
-https://scikit-learn.org/stable/modules/linear_model.html#ordinary-least-squares
-
-- [20] How to run Linear regression in Python scikit-Learn
-https://bigdata-madesimple.com/how-to-run-linear-regression-in-python-scikit-learn/
-
-- [21] A beginner’s guide to Linear Regression in Python with Scikit-Learn
-https://towardsdatascience.com/a-beginners-guide-to-linear-regression-in-python-with-scikit-learn-83a8f7ae2b4f
-
-- [22] Regression Analysis: How Do I Interpret R-squared and Assess the Goodness-of-Fit?
-https://blog.minitab.com/blog/adventures-in-statistics-2/regression-analysis-how-do-i-interpret-r-squared-and-assess-the-goodness-of-fit
-
-- [23] Python and R Tips To Learn Data Science: Pearson and Spearman Correlation in Python
-https://cmdlinetips.com/2019/08/how-to-compute-pearson-and-spearman-correlation-in-python/
-
-**Classification:**
-
-- [24] K-nearest Neighbors (KNN) Classification Model
-https://www.ritchieng.com/machine-learning-k-nearest-neighbors-knn/
-
-- [25] Supervised and Unsupervised Machine Learning Algorithms
-https://machinelearningmastery.com/supervised-and-unsupervised-machine-learning-algorithms/
-
-- [26] Cross-Validation
-https://www.ritchieng.com/machine-learning-cross-validation/
+## 4.2 Expectation Effectiveness <a name="sec4p2"></a>
+#### 4.2.1 Improve transportation inconvenience
+- Improve access to festival sites.
+- Alleviate traffic congestion near the festival site.
+  
+#### 4.2.2 Increase participation rate
+- Reduced transportation costs.
+- Encourage the participation of various age groups by simplifying travel routes.
 
-**References directly relating to Tips:**
+#### 4.2.3 Environmental benefits
+- Reduce carbon emissions.
+- Realize the value of **'green'**, the slogan of Muju Festival
 
-- [27] Tips data set in PYTHON MACHINE LEARNING EXAMPLE – LINEAR REGRESSION
-https://devarea.com/python-machine-learning-example-linear-regression/#.XbbfgOj7Q2w
+# 5. References <a name="references"></a>
 
-- [28] Tips analysis using Seaborn: Visualizing statistical relationships
-https://seaborn.pydata.org/tutorial/relational.html#relational-tutorial
+**Paper:**
 
-- [29] Tips analysis using Seaborn: Plotting with categorical data
-https://seaborn.pydata.org/tutorial/categorical.html#categorical-tutorial
+- [1]  Changsoo Kim, Hyungbin Jang. (2014). A Study on the Relations between Vistor orientation and Consumer spending in the past 3 years Muju Firefly Festival, 18(1), 1-19.
 
-- [30] Tips analysis using Seaborn: Visualizing linear relationships
-https://seaborn.pydata.org/tutorial/regression.html#regression-tutorial
+    https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART001866295
 
-- [31] Tips analysis using Seaborn: Building structured multi-plot grids
-https://seaborn.pydata.org/tutorial/axis_grids.html#grid-tutorial
+- [2] Huiseok Seo, Junghyun Yoon. (2006). A Study on the Success Factors of Regional Festivals -focusing on the Andong maskdance festival, Hampyeong butterfly festival, and Iksan Seodong festival-, 20(4), 207-228.
 
-- [32] STAT 503 Case Study 1: Restaurant Tipping (Author unknown)
-https://dicook.public.iastate.edu/stat503/05/cs-tips2.pdf
+    https://www.krila.re.kr/download/thesis/563
 
-- [33] Interactive analytics and predictions on Restaurant tips
-https://medium.com/@valentinaalto/interactive-analytics-and-predictions-on-restaurant-tips-94f21f537de8
+- [3] Changwoo Jeon, Gunhak Lee. (2017). Optimal Routing of Free Shuttle Bus to Enhance the Travel Convenience for the Elderly: A Case of Gwanak-gu, Seoul, 6(2), 291-304.
 
-- [34] Seaborn again: Python Data Visualisation using Seaborn 
-https://grindsquare.co.za/python-data-visualisation-using-seaborn/
+     https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002252629
 
-- [35] Excerpt from the Python Data Science Handbook by Jake VanderPlas; Jupyter notebooks are available on GitHub.
-https://jakevdp.github.io/PythonDataScienceHandbook/04.14-visualization-with-seaborn.html
+- [4] Eunhak Lee, Seung-young Ko, Dongkyu Kim. (2021). Optimization of Direct Bus Route using Smart Card Data. 85th Conference of the Korea Transportation Association
 
-- [36] Interactive analytics and predictions on Restaurant tips
-https://datasciencechalktalk.com/2019/11/03/interactive-analytics-and-predictions-on-restaurant-tips/
+    https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE10675729
 
-- [37] atlassian.com: .gitignore
-https://www.atlassian.com/git/tutorials/saving-changes/gitignore#personal-git-ignore-rules

From 8ee002bfb176049f42eef92d6fc353fdbb0b351c Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 21:15:11 +0900
Subject: [PATCH 07/10] docs #16: update README.md

---
 README.md | 37 +++++++++++++++++++++----------------
 1 file changed, 21 insertions(+), 16 deletions(-)

diff --git a/README.md b/README.md
index daadb28..83ed6d4 100644
--- a/README.md
+++ b/README.md
@@ -38,17 +38,20 @@ https://github.com/NongShiN/2024_bigcontest_muju_festival_shuttle_bus
 
 - This is "od_yyyymmdd_1.csv" data (hereinafter, od data), which is OD data between administrative periods from 2023.9.1 to 2023.10.15. 
 
-![2](https://github.com/user-attachments/assets/e44e5ded-101d-41b3-854a-b168218f0764)
+<img src="https://github.com/user-attachments/assets/e44e5ded-101d-41b3-854a-b168218f0764" alt="6" width="1000"/>
+
 
 - The other is "stay_yyyymmdd_1.csv" data (hereinafter, stay data), which is the national administrative unit residence population data from 2023.09.01 to 2023.10.15.
 
-![3](https://github.com/user-attachments/assets/252f80c4-fe6f-4f4d-b855-fb3c05861e81)
+
+<img src="https://github.com/user-attachments/assets/252f80c4-fe6f-4f4d-b855-fb3c05861e81" alt="6" width="400"/>
 
 
 ## 2.2 Visitor analysis <a name="sec2p2"></a>
 ### 2.2.1 Result of analyzing the number of people who visited Muju during the festival by age group.
 
-![4](https://github.com/user-attachments/assets/e52569e5-3986-4c8a-b5b4-4abb33bd4002)
+<img src="https://github.com/user-attachments/assets/e52569e5-3986-4c8a-b5b4-4abb33bd4002" alt="6" width="500"/>
+
 
 From this summary we can say that:
 1. The percentage of visitors under 10s is the highest, followed by those in their 40s and 30s.
@@ -57,7 +60,8 @@ From this summary we can say that:
 
 ### 2.2.2 Result of analyzing the number of people who stayed Muju during the festival by age group.
 
-![5](https://github.com/user-attachments/assets/8db12ac8-a353-476e-bc65-42db7a8ffff6)
+<img src="https://github.com/user-attachments/assets/8db12ac8-a353-476e-bc65-42db7a8ffff6" alt="6" width="500"/>
+
 
 From this summary we can say that:
 1. The percentage of staying people 40s is the highest, followed by those in their 30s, under 10s and 30s.
@@ -65,7 +69,7 @@ From this summary we can say that:
 
 ### 2.2.3 Result of distribution of festival visitors' residence.
 
-![6](https://github.com/user-attachments/assets/90005650-4b72-47b4-8ba4-67fb707b6548)
+<img src="https://github.com/user-attachments/assets/90005650-4b72-47b4-8ba4-67fb707b6548" alt="6" width="600"/>
 
 From this summary we can say that:
 1. It can be seen that many visitors to the festival came from Jeonbuk and Chungnam/Daejeon.
@@ -75,7 +79,7 @@ From this summary we can say that:
 ## 2.3 Movement analysis <a name="sec2p3"></a>
 ### 2.3.1 Result of the distribution of travel distance to Muju by age group.
 
-![7](https://github.com/user-attachments/assets/2462d208-d46b-48e8-b3d2-49a5879d7f4a)
+<img src="https://github.com/user-attachments/assets/2462d208-d46b-48e8-b3d2-49a5879d7f4a" alt="6" width="500"/>
 
 From this summary we can say that:
 1. Those under 10s and 30s and 40s visit from various distances, ranging from close to far away.
@@ -83,7 +87,9 @@ From this summary we can say that:
 
 ### 2.3.2 The distribution of transportation used by festival visitors.
 
-![8](https://github.com/user-attachments/assets/95ff40ef-124d-4605-b6a5-0919d49191b5)
+<img src="https://github.com/user-attachments/assets/95ff40ef-124d-4605-b6a5-0919d49191b5" alt="6" width="500"/>
+
+
 
 From this summary we can say that:
 - With 39019 cases of car use, most visitors visited the festival by car.
@@ -100,7 +106,7 @@ From this summary we can say that:
 
 Departure | Travel Route | Time |
 ------------|---------------|-------|
-Seoul           | Seoul Station (KT) → Daejeon Station (City Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 2h 30m |
+Seoul           | Seoul Station (KTX) → Daejeon Station (City Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 2h 30m |
 Jeonju     | Jeonju Express Bus Terminal (Express Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 2h 30m |
 Daegu  | Daegu Station (Mugunghwa Train) → Yeongdong Station (City Bus) → Muju Bus Terminal | 2h 40m |
 Busan     | Busan Station (SRT) → Daejeon Station (City Bus) → Daejeon Complex Terminal (Intercity Bus) → Muju Bus Terminal | 3h |
@@ -111,7 +117,7 @@ From this summary we can say that:
 
 #### 3.1.2.2 This shows the contents of the festival by time and the last bus time from the festival site to each city
 
-![9](https://github.com/user-attachments/assets/0e0b0fe5-369e-47ed-a524-4ccb00366553)
+<img src="https://github.com/user-attachments/assets/0e0b0fe5-369e-47ed-a524-4ccb00366553" alt="6" width="800"/>
 
 From this summary we can say that:
 1. Bus services are limited to certain areas and time zones.
@@ -136,13 +142,12 @@ From this survey we can say that:
 ## 3.2 Proposition of shuttle bus <a name="sec3p2"></a>
 ### The need for a shuttle bus
 
-![10](https://github.com/user-attachments/assets/65fe38cf-c70a-4f3d-b382-0b4d94a7d783)
-
+<img src="https://github.com/user-attachments/assets/65fe38cf-c70a-4f3d-b382-0b4d94a7d783" alt="6" width="800"/>
 
 ## 3.3 Shuttle bus rimetable <a name="sec3p3"></a>
 ### 3.1.1 To Muju
 
-![11](https://github.com/user-attachments/assets/61cc4717-acba-4088-8a25-ed9ec87164d0)
+<img src="https://github.com/user-attachments/assets/61cc4717-acba-4088-8a25-ed9ec87164d0" alt="6" width="1000"/>
 
 Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 ------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
@@ -156,7 +161,7 @@ Arrival | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 
 ### 3.1.2 To Return
 
-![12](https://github.com/user-attachments/assets/2136989f-1ba8-4fb5-9e2d-aa73b94ad058)
+<img src="https://github.com/user-attachments/assets/2136989f-1ba8-4fb5-9e2d-aa73b94ad058" alt="6" width="1000"/>
 
 Departure | Sat | Sun | Mon | Tue | Wed | Thu | Fri |
 ------------|---------------|---------------|---------------|---------------|---------------|---------------|---------------|
@@ -277,11 +282,11 @@ def number_of_visitors_to_Muju_by_region():
 ```
 ### 3.4.2 Linear Programing
 
-![13](https://github.com/user-attachments/assets/bc839c99-8357-4ad6-818d-61001801cd1e)
+<img src="https://github.com/user-attachments/assets/bc839c99-8357-4ad6-818d-61001801cd1e" alt="6" width="600"/>
 
 ### 3.4.3 Flow Chart
 
-<img src="https://github.com/user-attachments/assets/bee931c6-d9a3-4301-88dd-f6c2de8b52f6" alt="14" width="500"/>
+<img src="https://github.com/user-attachments/assets/bee931c6-d9a3-4301-88dd-f6c2de8b52f6" alt="14" width="600"/>
 
 ### 3.4.4 Recommended Route
 #### 3.4.4.1 Daejeon Line
@@ -299,7 +304,7 @@ Rank | Travel Route | Distance | Time |
 
 #### 3.4.4.2 Jeonbuk Line
 
-![16](https://github.com/user-attachments/assets/9c351b94-87bd-4f06-89de-a621f191e217)
+<img src="https://github.com/user-attachments/assets/9c351b94-87bd-4f06-89de-a621f191e217" alt="6" width="800"/>
 
 Rank | Travel Route | Distance | Time |
 ------------|---------------|---------------|---------------|

From 32d8460860d4a81b747951175287955603b011b6 Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 21:18:48 +0900
Subject: [PATCH 08/10] refactor #15 delete EDA/chanwoong/festival_map.html

---
 EDA/chanwoong/festival_map.html | 606 --------------------------------
 1 file changed, 606 deletions(-)
 delete mode 100644 EDA/chanwoong/festival_map.html

diff --git a/EDA/chanwoong/festival_map.html b/EDA/chanwoong/festival_map.html
deleted file mode 100644
index 7c79880..0000000
--- a/EDA/chanwoong/festival_map.html
+++ /dev/null
@@ -1,606 +0,0 @@
-<!DOCTYPE html>
-<html>
-<head>
-    
-    <meta http-equiv="content-type" content="text/html; charset=UTF-8" />
-    
-        <script>
-            L_NO_TOUCH = false;
-            L_DISABLE_3D = false;
-        </script>
-    
-    <style>html, body {width: 100%;height: 100%;margin: 0;padding: 0;}</style>
-    <style>#map {position:absolute;top:0;bottom:0;right:0;left:0;}</style>
-    <script src="https://cdn.jsdelivr.net/npm/leaflet@1.9.3/dist/leaflet.js"></script>
-    <script src="https://code.jquery.com/jquery-3.7.1.min.js"></script>
-    <script src="https://cdn.jsdelivr.net/npm/bootstrap@5.2.2/dist/js/bootstrap.bundle.min.js"></script>
-    <script src="https://cdnjs.cloudflare.com/ajax/libs/Leaflet.awesome-markers/2.0.2/leaflet.awesome-markers.js"></script>
-    <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/leaflet@1.9.3/dist/leaflet.css"/>
-    <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/bootstrap@5.2.2/dist/css/bootstrap.min.css"/>
-    <link rel="stylesheet" href="https://netdna.bootstrapcdn.com/bootstrap/3.0.0/css/bootstrap-glyphicons.css"/>
-    <link rel="stylesheet" href="https://cdn.jsdelivr.net/npm/@fortawesome/fontawesome-free@6.2.0/css/all.min.css"/>
-    <link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/Leaflet.awesome-markers/2.0.2/leaflet.awesome-markers.css"/>
-    <link rel="stylesheet" href="https://cdn.jsdelivr.net/gh/python-visualization/folium/folium/templates/leaflet.awesome.rotate.min.css"/>
-    
-            <meta name="viewport" content="width=device-width,
-                initial-scale=1.0, maximum-scale=1.0, user-scalable=no" />
-            <style>
-                #map_3b1e3893d9ab7f234de918b79096262a {
-                    position: relative;
-                    width: 100.0%;
-                    height: 100.0%;
-                    left: 0.0%;
-                    top: 0.0%;
-                }
-                .leaflet-container { font-size: 1rem; }
-            </style>
-        
-</head>
-<body>
-    
-    
-            <div class="folium-map" id="map_3b1e3893d9ab7f234de918b79096262a" ></div>
-        
-</body>
-<script>
-    
-    
-            var map_3b1e3893d9ab7f234de918b79096262a = L.map(
-                "map_3b1e3893d9ab7f234de918b79096262a",
-                {
-                    center: [36.007, 127.6609],
-                    crs: L.CRS.EPSG3857,
-                    zoom: 9,
-                    zoomControl: true,
-                    preferCanvas: false,
-                }
-            );
-
-            
-
-        
-    
-            var tile_layer_a9f76908ca16c75d49d8bbc87c02de39 = L.tileLayer(
-                "https://tile.openstreetmap.org/{z}/{x}/{y}.png",
-                {"attribution": "\u0026copy; \u003ca href=\"https://www.openstreetmap.org/copyright\"\u003eOpenStreetMap\u003c/a\u003e contributors", "detectRetina": false, "maxNativeZoom": 19, "maxZoom": 19, "minZoom": 0, "noWrap": false, "opacity": 1, "subdomains": "abc", "tms": false}
-            );
-        
-    
-            tile_layer_a9f76908ca16c75d49d8bbc87c02de39.addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var marker_3ffe4c2725b88a29702fa8503c59a250 = L.marker(
-                [36.007, 127.6609],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_c0fbc86a5df6c582562aee0042b838d6 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "red", "prefix": "glyphicon"}
-            );
-            marker_3ffe4c2725b88a29702fa8503c59a250.setIcon(icon_c0fbc86a5df6c582562aee0042b838d6);
-        
-    
-        var popup_78633ddb6117636d89991e8d43912628 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_340210a508397e4bb0fa91397a615387 = $(`<div id="html_340210a508397e4bb0fa91397a615387" style="width: 100.0%; height: 100.0%;">축제 개최지</div>`)[0];
-                popup_78633ddb6117636d89991e8d43912628.setContent(html_340210a508397e4bb0fa91397a615387);
-            
-        
-
-        marker_3ffe4c2725b88a29702fa8503c59a250.bindPopup(popup_78633ddb6117636d89991e8d43912628)
-        ;
-
-        
-    
-    
-            var circle_bd44ad58502fc4a135008501a6a28a54 = L.circle(
-                [36.007, 127.6609],
-                {"bubblingMouseEvents": true, "color": "crimson", "dashArray": null, "dashOffset": null, "fill": true, "fillColor": "crimson", "fillOpacity": 0.2, "fillRule": "evenodd", "lineCap": "round", "lineJoin": "round", "opacity": 1.0, "radius": 50000, "stroke": true, "weight": 3}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-        var popup_d14582fdf91af2c56c4e0895eb072f17 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_7e5b98ef6ff1579bb7d95d5531bab954 = $(`<div id="html_7e5b98ef6ff1579bb7d95d5531bab954" style="width: 100.0%; height: 100.0%;">100km 반경</div>`)[0];
-                popup_d14582fdf91af2c56c4e0895eb072f17.setContent(html_7e5b98ef6ff1579bb7d95d5531bab954);
-            
-        
-
-        circle_bd44ad58502fc4a135008501a6a28a54.bindPopup(popup_d14582fdf91af2c56c4e0895eb072f17)
-        ;
-
-        
-    
-    
-            var marker_6d2087a98bb18b74510ec7f7c202ef68 = L.marker(
-                [35.8475612, 127.1176723],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_23b4b5c0f0b4bcfbd0b9c7ed1971b716 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_6d2087a98bb18b74510ec7f7c202ef68.setIcon(icon_23b4b5c0f0b4bcfbd0b9c7ed1971b716);
-        
-    
-        var popup_8db7aad026f5171ef7710515a7faa44f = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_3ec5603437f75ca73b1d24f458fe719d = $(`<div id="html_3ec5603437f75ca73b1d24f458fe719d" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도 <br>전주시 덕진구<br>                     4511300000                   </div></div>`)[0];
-                popup_8db7aad026f5171ef7710515a7faa44f.setContent(html_3ec5603437f75ca73b1d24f458fe719d);
-            
-        
-
-        marker_6d2087a98bb18b74510ec7f7c202ef68.bindPopup(popup_8db7aad026f5171ef7710515a7faa44f)
-        ;
-
-        
-    
-    
-            var marker_21dc6ddf96abca3971f08aaae814871d = L.marker(
-                [36.3551786, 127.3838487],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_68311d5c9290a3ca42997d535b758b64 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_21dc6ddf96abca3971f08aaae814871d.setIcon(icon_68311d5c9290a3ca42997d535b758b64);
-        
-    
-        var popup_6e2b8f36f8e7a44d1e2f9dcafb2739a0 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_c874ffcfabc91bb894b5c3601df7a820 = $(`<div id="html_c874ffcfabc91bb894b5c3601df7a820" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대전광역시 <br>서구<br>                     3017000000                   </div></div>`)[0];
-                popup_6e2b8f36f8e7a44d1e2f9dcafb2739a0.setContent(html_c874ffcfabc91bb894b5c3601df7a820);
-            
-        
-
-        marker_21dc6ddf96abca3971f08aaae814871d.bindPopup(popup_6e2b8f36f8e7a44d1e2f9dcafb2739a0)
-        ;
-
-        
-    
-    
-            var marker_00c8481935e7c2f2be0be4c840e809c0 = L.marker(
-                [35.7955115, 127.132447],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_993fb64b1da07c7252a92f67d492fd9f = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_00c8481935e7c2f2be0be4c840e809c0.setIcon(icon_993fb64b1da07c7252a92f67d492fd9f);
-        
-    
-        var popup_5b7b55d40e5d96aee985c9b9fdba5226 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_06dba669531a72819fb6ccd7d97b2c22 = $(`<div id="html_06dba669531a72819fb6ccd7d97b2c22" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도 <br>전주시 완산구<br>                     4511100000                   </div></div>`)[0];
-                popup_5b7b55d40e5d96aee985c9b9fdba5226.setContent(html_06dba669531a72819fb6ccd7d97b2c22);
-            
-        
-
-        marker_00c8481935e7c2f2be0be4c840e809c0.bindPopup(popup_5b7b55d40e5d96aee985c9b9fdba5226)
-        ;
-
-        
-    
-    
-            var marker_92913b7d12b7679f7f33c0f223e32445 = L.marker(
-                [36.3620732, 127.3564099],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_b5a691efb792c947ac858f915a77e507 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_92913b7d12b7679f7f33c0f223e32445.setIcon(icon_b5a691efb792c947ac858f915a77e507);
-        
-    
-        var popup_28caa0c61ddc2eda2e0678e815691d72 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_1064e33651a747769233fc9f2d57a13d = $(`<div id="html_1064e33651a747769233fc9f2d57a13d" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대전광역시 <br>유성구<br>                     3020000000                   </div></div>`)[0];
-                popup_28caa0c61ddc2eda2e0678e815691d72.setContent(html_1064e33651a747769233fc9f2d57a13d);
-            
-        
-
-        marker_92913b7d12b7679f7f33c0f223e32445.bindPopup(popup_28caa0c61ddc2eda2e0678e815691d72)
-        ;
-
-        
-    
-    
-            var marker_72360d4539e456c6ec8b4794c47a42d9 = L.marker(
-                [35.8865, 128.6355],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_ce8f7d1623f13f792bd4152450f4be7b = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_72360d4539e456c6ec8b4794c47a42d9.setIcon(icon_ce8f7d1623f13f792bd4152450f4be7b);
-        
-    
-        var popup_02a667fa6099afa678fad2bed75b484e = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_9e212e240bc05f910830b22df63aee66 = $(`<div id="html_9e212e240bc05f910830b22df63aee66" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대전광역시 <br>동구<br>                     3011000000                   </div></div>`)[0];
-                popup_02a667fa6099afa678fad2bed75b484e.setContent(html_9e212e240bc05f910830b22df63aee66);
-            
-        
-
-        marker_72360d4539e456c6ec8b4794c47a42d9.bindPopup(popup_02a667fa6099afa678fad2bed75b484e)
-        ;
-
-        
-    
-    
-            var marker_ff974759cd7a5f2042a005ffad8947b6 = L.marker(
-                [35.647, 127.5212],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_aac857f9876ba978dc8ad64c7c1b6b3b = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_ff974759cd7a5f2042a005ffad8947b6.setIcon(icon_aac857f9876ba978dc8ad64c7c1b6b3b);
-        
-    
-        var popup_646d6e4b2c16ee217350ff5646bc89cf = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_939003931d17ea6f90356a01285bde9b = $(`<div id="html_939003931d17ea6f90356a01285bde9b" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도 <br>장수군<br>                     4574000000                   </div></div>`)[0];
-                popup_646d6e4b2c16ee217350ff5646bc89cf.setContent(html_939003931d17ea6f90356a01285bde9b);
-            
-        
-
-        marker_ff974759cd7a5f2042a005ffad8947b6.bindPopup(popup_646d6e4b2c16ee217350ff5646bc89cf)
-        ;
-
-        
-    
-    
-            var marker_a208b6f951c69ecfbbf0510673ca74c8 = L.marker(
-                [36.1747588, 127.7834432],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_2c13be16282b64194072a87246015bc5 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "blue", "prefix": "glyphicon"}
-            );
-            marker_a208b6f951c69ecfbbf0510673ca74c8.setIcon(icon_2c13be16282b64194072a87246015bc5);
-        
-    
-        var popup_92e3f1cc3a68d1f2e1b29e45c6fca9f4 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_61141e6f097cd92a714d4c584ffc4914 = $(`<div id="html_61141e6f097cd92a714d4c584ffc4914" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     충청북도 <br>영동군<br>                     4374000000                   </div></div>`)[0];
-                popup_92e3f1cc3a68d1f2e1b29e45c6fca9f4.setContent(html_61141e6f097cd92a714d4c584ffc4914);
-            
-        
-
-        marker_a208b6f951c69ecfbbf0510673ca74c8.bindPopup(popup_92e3f1cc3a68d1f2e1b29e45c6fca9f4)
-        ;
-
-        
-    
-    
-            var marker_9cf43cf62a3d1c2f35f573dab4f7d627 = L.marker(
-                [36.4799999, 127.289],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_ed63225266af3268b4b48046c41cdfc2 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_9cf43cf62a3d1c2f35f573dab4f7d627.setIcon(icon_ed63225266af3268b4b48046c41cdfc2);
-        
-    
-        var popup_cc723436b36dad3f44dec258d6dddb57 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_6f86a0a4311ae02961d18ea1b40021cd = $(`<div id="html_6f86a0a4311ae02961d18ea1b40021cd" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     세종특별자치시<br> nan<br>                     3600000000                   </div></div>`)[0];
-                popup_cc723436b36dad3f44dec258d6dddb57.setContent(html_6f86a0a4311ae02961d18ea1b40021cd);
-            
-        
-
-        marker_9cf43cf62a3d1c2f35f573dab4f7d627.bindPopup(popup_cc723436b36dad3f44dec258d6dddb57)
-        ;
-
-        
-    
-    
-            var marker_c55d4b2b3a4a80f40e81e6350b74b820 = L.marker(
-                [36.1087249, 127.48815],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_e2acd7bc320c57630b73f648263ae7f0 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_c55d4b2b3a4a80f40e81e6350b74b820.setIcon(icon_e2acd7bc320c57630b73f648263ae7f0);
-        
-    
-        var popup_f83575203489b4b0529bcef25873dc1d = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_632978578212a9199fc23d0ad6b86bad = $(`<div id="html_632978578212a9199fc23d0ad6b86bad" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     충청남도<br> 금산군<br>                     4471000000                   </div></div>`)[0];
-                popup_f83575203489b4b0529bcef25873dc1d.setContent(html_632978578212a9199fc23d0ad6b86bad);
-            
-        
-
-        marker_c55d4b2b3a4a80f40e81e6350b74b820.bindPopup(popup_f83575203489b4b0529bcef25873dc1d)
-        ;
-
-        
-    
-    
-            var marker_0d9c45e2863c1497c8201e2fdb7d0108 = L.marker(
-                [36.3254513, 127.4211737],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_189952031fb5c4d577295d1b80cb9609 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_0d9c45e2863c1497c8201e2fdb7d0108.setIcon(icon_189952031fb5c4d577295d1b80cb9609);
-        
-    
-        var popup_3e86d91cd1b8ee8ee7a0ddcee7be8281 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_4773a1f5d638e71a1655b5d98dbf6e9e = $(`<div id="html_4773a1f5d638e71a1655b5d98dbf6e9e" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대전광역시<br> 중구<br>                     3014000000                   </div></div>`)[0];
-                popup_3e86d91cd1b8ee8ee7a0ddcee7be8281.setContent(html_4773a1f5d638e71a1655b5d98dbf6e9e);
-            
-        
-
-        marker_0d9c45e2863c1497c8201e2fdb7d0108.bindPopup(popup_3e86d91cd1b8ee8ee7a0ddcee7be8281)
-        ;
-
-        
-    
-    
-            var marker_39c12fafccdb1217b13e3b639c4ee952 = L.marker(
-                [35.9479291, 126.9578025],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_d9e0c39d1a9048192bd38e798b748514 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_39c12fafccdb1217b13e3b639c4ee952.setIcon(icon_d9e0c39d1a9048192bd38e798b748514);
-        
-    
-        var popup_3db9b2d0386a8fd086d43f40a1191939 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_75a2006031b93aa360bb77b3f0edb4f0 = $(`<div id="html_75a2006031b93aa360bb77b3f0edb4f0" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도<br> 익산시<br>                     4514000000                   </div></div>`)[0];
-                popup_3db9b2d0386a8fd086d43f40a1191939.setContent(html_75a2006031b93aa360bb77b3f0edb4f0);
-            
-        
-
-        marker_39c12fafccdb1217b13e3b639c4ee952.bindPopup(popup_3db9b2d0386a8fd086d43f40a1191939)
-        ;
-
-        
-    
-    
-            var marker_859a3aea9aee8f3b8be9b56deacb102b = L.marker(
-                [35.7915, 127.4252],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_513359c2e0bc855d96f2c808446e305b = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_859a3aea9aee8f3b8be9b56deacb102b.setIcon(icon_513359c2e0bc855d96f2c808446e305b);
-        
-    
-        var popup_8c80641ee262457109ad15f0b40fd35d = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_531631becd5f45588d58c39ee33f15f1 = $(`<div id="html_531631becd5f45588d58c39ee33f15f1" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도<br> 진안군<br>                     4572000000                   </div></div>`)[0];
-                popup_8c80641ee262457109ad15f0b40fd35d.setContent(html_531631becd5f45588d58c39ee33f15f1);
-            
-        
-
-        marker_859a3aea9aee8f3b8be9b56deacb102b.bindPopup(popup_8c80641ee262457109ad15f0b40fd35d)
-        ;
-
-        
-    
-    
-            var marker_683b2cd4794bdcbc8ec2c811d17964ea = L.marker(
-                [35.6872871, 127.9138505],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_f1ec2129d107e38549693024a23bf13f = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_683b2cd4794bdcbc8ec2c811d17964ea.setIcon(icon_f1ec2129d107e38549693024a23bf13f);
-        
-    
-        var popup_8057d11370a2e7e333218c08ba149781 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_5789df2340ee413d54ab636b2f8e7244 = $(`<div id="html_5789df2340ee413d54ab636b2f8e7244" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     경상남도<br> 거창군<br>                     4888000000                   </div></div>`)[0];
-                popup_8057d11370a2e7e333218c08ba149781.setContent(html_5789df2340ee413d54ab636b2f8e7244);
-            
-        
-
-        marker_683b2cd4794bdcbc8ec2c811d17964ea.bindPopup(popup_8057d11370a2e7e333218c08ba149781)
-        ;
-
-        
-    
-    
-            var marker_bd0918136d98d2d2575004bab3a4f566 = L.marker(
-                [35.8297006, 128.533094],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_4fd3ca64dc86f962adeb3356850e364a = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_bd0918136d98d2d2575004bab3a4f566.setIcon(icon_4fd3ca64dc86f962adeb3356850e364a);
-        
-    
-        var popup_115e767ec4daa2d7591b18ada7b07161 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_a09121e537e794929f6399d910077289 = $(`<div id="html_a09121e537e794929f6399d910077289" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대구광역시<br> 달서구<br>                     2729000000                   </div></div>`)[0];
-                popup_115e767ec4daa2d7591b18ada7b07161.setContent(html_a09121e537e794929f6399d910077289);
-            
-        
-
-        marker_bd0918136d98d2d2575004bab3a4f566.bindPopup(popup_115e767ec4daa2d7591b18ada7b07161)
-        ;
-
-        
-    
-    
-            var marker_d3cc52491a65fe0cd5156fe197f9b678 = L.marker(
-                [35.9679984, 126.7369036],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_cec349988b677025da984dc8f1145941 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_d3cc52491a65fe0cd5156fe197f9b678.setIcon(icon_cec349988b677025da984dc8f1145941);
-        
-    
-        var popup_ad50a79cbbd1ddcc0c685894a6351b76 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_23d839c17f6ff53ab4fdbbdd914d2bf4 = $(`<div id="html_23d839c17f6ff53ab4fdbbdd914d2bf4" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     전라북도<br> 군산시<br>                     4513000000                   </div></div>`)[0];
-                popup_ad50a79cbbd1ddcc0c685894a6351b76.setContent(html_23d839c17f6ff53ab4fdbbdd914d2bf4);
-            
-        
-
-        marker_d3cc52491a65fe0cd5156fe197f9b678.bindPopup(popup_ad50a79cbbd1ddcc0c685894a6351b76)
-        ;
-
-        
-    
-    
-            var marker_a5cd8cc05e58653bd377fd347294ae0c = L.marker(
-                [36.1398035, 128.1139534],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_6baec664180283402348fec163c44356 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_a5cd8cc05e58653bd377fd347294ae0c.setIcon(icon_6baec664180283402348fec163c44356);
-        
-    
-        var popup_203714487a8bdc3290af53cbb226ad91 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_45a98883fd39de3aa13b76cb5b746dcb = $(`<div id="html_45a98883fd39de3aa13b76cb5b746dcb" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     경상북도<br> 김천시<br>                     4715000000                   </div></div>`)[0];
-                popup_203714487a8bdc3290af53cbb226ad91.setContent(html_45a98883fd39de3aa13b76cb5b746dcb);
-            
-        
-
-        marker_a5cd8cc05e58653bd377fd347294ae0c.bindPopup(popup_203714487a8bdc3290af53cbb226ad91)
-        ;
-
-        
-    
-    
-            var marker_f71e2b07ab73b40c680891a49b4d35ee = L.marker(
-                [36.3465766, 127.4158483],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_80dbd3c838db65218161960c1e2d847b = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_f71e2b07ab73b40c680891a49b4d35ee.setIcon(icon_80dbd3c838db65218161960c1e2d847b);
-        
-    
-        var popup_9f08624ff6c896cf7326d51c441a48e0 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_d7531c8aa5d36956d74e2c1202012b58 = $(`<div id="html_d7531c8aa5d36956d74e2c1202012b58" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     대전광역시<br> 대덕구<br>                     3023000000                   </div></div>`)[0];
-                popup_9f08624ff6c896cf7326d51c441a48e0.setContent(html_d7531c8aa5d36956d74e2c1202012b58);
-            
-        
-
-        marker_f71e2b07ab73b40c680891a49b4d35ee.bindPopup(popup_9f08624ff6c896cf7326d51c441a48e0)
-        ;
-
-        
-    
-    
-            var marker_042c687a8178470c860eeb8798d86de7 = L.marker(
-                [35.1803459, 128.1079953],
-                {}
-            ).addTo(map_3b1e3893d9ab7f234de918b79096262a);
-        
-    
-            var icon_6bf3ce81291a995728d72f51cb9a7096 = L.AwesomeMarkers.icon(
-                {"extraClasses": "fa-rotate-0", "icon": "info-sign", "iconColor": "white", "markerColor": "lightblue", "prefix": "glyphicon"}
-            );
-            marker_042c687a8178470c860eeb8798d86de7.setIcon(icon_6bf3ce81291a995728d72f51cb9a7096);
-        
-    
-        var popup_ad8864f12e7de6c4bfa5d838a7899940 = L.popup({"maxWidth": "100%"});
-
-        
-            
-                var html_8d43a9524b624021d862f4f46bf276ee = $(`<div id="html_8d43a9524b624021d862f4f46bf276ee" style="width: 100.0%; height: 100.0%;"><div style="font-size:12px;">                     경상남도<br> 진주시<br>                     4817000000                   </div></div>`)[0];
-                popup_ad8864f12e7de6c4bfa5d838a7899940.setContent(html_8d43a9524b624021d862f4f46bf276ee);
-            
-        
-
-        marker_042c687a8178470c860eeb8798d86de7.bindPopup(popup_ad8864f12e7de6c4bfa5d838a7899940)
-        ;
-
-        
-    
-</script>
-</html>
\ No newline at end of file

From 4ef85fff8f74c67fd2c50d93776a7eaa3c97c50b Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Sun, 3 Nov 2024 21:24:05 +0900
Subject: [PATCH 09/10] refactor #15: update .gitattributes

---
 .gitattributes | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/.gitattributes b/.gitattributes
index eedbb7f..7466880 100644
--- a/.gitattributes
+++ b/.gitattributes
@@ -1 +1,2 @@
-*.ipynb -linguist-detectable
\ No newline at end of file
+*.ipynb -linguist-detectable
+*.html -linguist-detectable

From 25c3fffb1e344da606fcb35385546d96a3b191ad Mon Sep 17 00:00:00 2001
From: Chanwoong Hwang <44831566+NongShiN@users.noreply.github.com>
Date: Tue, 5 Nov 2024 13:35:43 +0900
Subject: [PATCH 10/10] docs #16: update README.md

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 83ed6d4..5408d7e 100644
--- a/README.md
+++ b/README.md
@@ -142,7 +142,7 @@ From this survey we can say that:
 ## 3.2 Proposition of shuttle bus <a name="sec3p2"></a>
 ### The need for a shuttle bus
 
-<img src="https://github.com/user-attachments/assets/65fe38cf-c70a-4f3d-b382-0b4d94a7d783" alt="6" width="800"/>
+<img src="https://github.com/user-attachments/assets/8250e5f7-aa5f-40a9-a08c-8b6d08e7002f" alt="6" width="800"/>
 
 ## 3.3 Shuttle bus rimetable <a name="sec3p3"></a>
 ### 3.1.1 To Muju