segmentation

jeonghunyoon · jeonghunyoon · commit f6f11c2aa21b · 2019-06-05T02:33:00.000+09:00
diff --git a/Lecture20_Customer_Segmentation_Easy_version.ipynb b/Lecture20_Customer_Segmentation_Easy_version.ipynb
@@ -1,5 +1,25 @@
 {
  "cells": [
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# Customer segmentation using clustering an classification (Simple)\n",
+    "\n",
+    "https://archive.ics.uci.edu/ml/datasets/online+retail : \n",
+    "\n",
+    "&#51060; &#45936;&#51060;&#53552;&#45716; 2010&#45380; 1&#50900; 20&#51068; &#48512;&#53552; 2011&#45380; 9&#50900; 12&#51068; &#44620;&#51648;&#51032; &#44592;&#44036;&#46041;&#50504; &#50728;&#46972;&#51064; &#51204;&#51088;&#49345;&#44144;&#47000; &#54924;&#49324;&#50640;&#49436; &#48156;&#49373;&#54620; transaction&#50640; &#45824;&#54620; &#45236;&#50857;&#51077;&#45768;&#45796;. &#54644;&#45817; &#54924;&#49324;&#45716; UK-based non-store online &#51204;&#51088;&#49345;&#44144;&#47000; &#54924;&#49324;&#51077;&#45768;&#45796;.\n",
+    "\n",
+    "&#51060; &#45936;&#51060;&#53552;&#47484; &#51060;&#50857;&#54616;&#50668;, &#49324;&#50857;&#51088;&#51032; &#54665;&#46041;&#51012; &#48516;&#49437;&#54633;&#45768;&#45796;. &#49324;&#50857;&#51088;&#51032; &#54665;&#46041;&#51012; &#48516;&#49437;&#54616;&#50668;, &#49324;&#50857;&#51088;&#51032; &#54665;&#46041;&#51012; &#50696;&#52769;&#54616;&#45716; &#47784;&#45944;&#51012; &#47564;&#46308;&#44192;&#49845;&#45768;&#45796;."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# 1. &#45936;&#51060;&#53552; &#47196;&#46377;"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 2,
@@ -171,6 +191,13 @@
     "dataset.head()"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# 2. &#45936;&#51060;&#53552; &#51204;&#52376;&#47532;"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 4,
@@ -482,7 +509,14 @@
    "source": [
     "# 4339&#47749;&#51032; transaction&#51012; &#44032;&#51648;&#44256; &#51080;&#45796;.\n",
     "df_customerid_groups=dataset.groupby(\"CustomerID\")\n",
-    "print(len((df_customerid_groups.groups)))"
+    "print(len(df_customerid_groups.groups))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# 3. Clustering"
    ]
   },
   {
@@ -590,6 +624,14 @@
     "df_cluster.head()"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "- Quantity\n",
+    "- UnitPrice"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 26,
@@ -617,19 +659,32 @@
     "x"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> Feature Scaling"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 27,
    "metadata": {},
    "outputs": [],
    "source": [
-    "# Feature Scaling\n",
     "from sklearn.preprocessing import StandardScaler\n",
     "\n",
     "sc_x = StandardScaler()\n",
     "x = sc_x.fit_transform(x)"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> K-means"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 28,
@@ -681,6 +736,13 @@
     "plt.ylabel('With in cluster sum of squers(WCSS)')"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "> &#49884;&#44033;&#54868;"
+   ]
+  },
   {
    "cell_type": "code",
    "execution_count": 30,
@@ -804,7 +866,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.6.3"
+   "version": "3.6.7"
   }
  },
  "nbformat": 4,