Update README to reflect new Vagrantfile

jet2007 · Mar 18, 2016 · 7015d67 · 7015d67
1 parent 1c3b838
commit 7015d67
Showing 1 changed file with 37 additions and 116 deletions.
diff --git a/vagrant/README.md b/vagrant/README.md
@@ -8,17 +8,17 @@ the original version of this document. Alexey Grishchenko (@0x0FFF) has
 also participated in improvement of the document and scripts.
 
 ## Who should read this document?
-Any one who wants to develop code for GPDB. This guide targets the
-free-lance developer who typically has a laptop and wants to develop
+Anyone who wants to develop code for GPDB. This guide targets the
+freelance developer who typically has a laptop and wants to develop
 GPDB code on it. In other words, such a typical developer does not necessarily
 have 24x7 access to a cluster, and needs a miminal stand-alone development
 enviroment.
 
-The instructions here were first written and verified on OSX El Capitan,
-v.10.11.2. In the future, we hope to add to the list below.
+The instructions here were verified on the configurations below.
 
 | **OS**        | **Date Tested**    | **Comments**                           |
 | :------------ |:-------------------| --------------------------------------:|
+| OSX v.10.10.5 | 2016-03-17         | Vagrant v. 1.8.1; VirtualBox v. 5.0.16 |
 | OSX v.10.11.2 | 2015-12-29         | Vagrant v. 1.8.1; VirtualBox v. 5.0.12 |
 
 ## 1: Setup VirtualBox and Vagrant
@@ -48,33 +48,29 @@ Next let us start a virtual machine using the Vagrant file in that directory.
 From the terminal window, issue the following command:
 
 ```shell
-vagrant up
+vagrant up gpdb
 ```
 
 The last command will take a while as Vagrant works with VirtualBox to fetch
-a box image for CentOS. The box image is many hundred MiBs is size, so it takes
-a while. This image is fetched only once and will be stored by vagrant in a
-directory (likely `~/.vagrant.d/boxes/`), so you won't incur this network IO
-if you repeat the steps above. A side-effect is that vagrant has now used a
+a box image for CentOS. This image is fetched only once and will be stored by Vagrant in a
+directory (likely `~/.vagrant.d/boxes/`), so you won't repeatedly incur this network IO
+if you repeat the steps above. A side-effect is that Vagrant has now used a
 few hundred MiBs of space on your machine. You can see the list of boxes that
-vagrant has downloaded using ``vagrant box list``. If you need to drop some
+Vagrant has downloaded using ``vagrant box list``. If you need to drop some
 box images, follow the instructions posted [here](https://docs.vagrantup.com/v2/cli/box.html "vagrant manage boxes").
 
 If you are curious about what Vagrant is doing, then open the file
 `Vagrantfile`. The `config.vm.box` parameter there specifies the
-vagrant box image that is being feteched. Essentially you are creating an
+Vagrant box image that is being fetched. Essentially you are creating an
 image of CentOS on your machine that will be used below to setup and run GPDB.
 
 While you are viewing the Vagrantfile, a few more things to notice here are:
 * The parameter `vb.memory` sets the memory to 8GB for the virtual machine.
   You could dial that number up or down depending on the actual memory in
   your machine.
 * The parameter `vb.cpus` sets the number of cores that the virtual machine will
-  use to 2. Again, feel free to change this number based on the machine that
+  use to 4. Again, feel free to change this number based on the machine that
   you have.
-* Notice the parameter `config.vm.synced_folder`. This configuration requests
-  that the code you checked out is mounted as `/gpdb` in the virtual machine.
-  More on this later below.
 * Additional synced folders can be configured by adding a `vagrant-local.yml`
   configuration file on the following format:
 
@@ -86,55 +82,50 @@ synced_folder:
       shared: /another/folder/in/vagrant
 ```
 
-Once the command above (`vagrant up`) returns, we are ready to login to the
+Once the command above (`vagrant up gpdb`) returns, we are ready to login to the
 virtual machine. Type in the following command into the terminal window
 (make sure that you are in the directory `gpdb/vagrant/centos`):
 
 ```shell
-vagrant ssh
+vagrant ssh gpdb
 ```
 
 Now you are in the virtual machine shell in a **guest** OS that is running in
 your actual machine (the **host**). Everything that you do in the guest machine
-will be isolated from the host, except for any changes that you make to
-``/gpdb`` - recall that we requested that the code we checked out show up at
-this mount point in the virtual machine. Thus, if you type ``ls /gpdb`` in the
-virtual machine shell, then you will see the GPDB code that you checked out.
+will be isolated from the host.
 
-That's it - GPDB is built, up and running. You can open database connection right
-away with ``psql -d template1`` and start creating your stuff, or run the
-command ``make installcheck-good`` in ``/gpdb`` directory to run Greenplum tests
+That's it - GPDB is built, up and running.  Before you can open a psql connection, run the following:
+```shell
+# setup the environment
+source /usr/local/gpdb/greenplum_path.sh
+source ~/gpdb/gpAux/gpdemo/gpdemo-env.sh
+
+# create a database to interact with (you only need to do this once)
+createdb
+
+# connect!
+psql
+```
+
+To run the tests:
+```shell
+cd ~/gpdb
+make installcheck-good
+```
 
 If you are curious how this happened, take a look at the following scripts:
 * `vagrant/centos/vagrant-setup.sh` - this script installs all the packages
   required for GPDB as dependencies
 * `vagrant/centos/vagrant-build.sh` - this script builds GPDB. In case you
   need to change build options you can change this file and re-create VM by
-  running `vagrant destroy` followed by `vagrant up`
+  running `vagrant destroy gpdb` followed by `vagrant up gpdb`
 * `vagrant/centos/vagrant-configure-os.sh` - this script configures OS
   parameters required for running GPDB
-* `vagrant/centos/vagrant-install-inner.sh` - this script is responsible for
-  GPDB installation
 
 You can easily go to `vagrant/centos/Vagrantfile` and comment out the calls for
 any of these scripts at any time to prevent GPDB installation or OS-level
 configurations
 
-Now, you are ready to start GPDB. To do that, type in the following commands
-into the (guest) terminal:
-```shell
-DBNAME=$USER
-createdb $DBNAME
-psql $DBNAME
-```
-
-You should see a psql prompt that says `vagrant=#`. At this point, you can open
-up another shell from the host OS. Start another *host* terminal, and go to
-the vagrant directory by typing `cd gpdb/vagrant`. Then, type in `vagrant ssh`.
-From this second guest terminal, you can run GPDB commands like `gpstate`.
-Go ahead and try it. You should see a report that states that you have a master
-and two segments.
-
 If you want to try out a few SQL commands, go back to the guest shell in which
 you have the `psql` prompt, and issue the following SQL commands:
 
@@ -170,7 +161,7 @@ ORDER BY M.ptime;
 ```
 
 You just created a simple warehouse database that simulates users posting
-messages on a social media network. The "fact" table (aka. the `Messages`
+messages on a social media network. The "fact" table (i.e. the `Messages`
 table) has a million rows. The final query reports the number of messages
 that were posted on each day. Pretty cool!
 
@@ -186,7 +177,7 @@ like: ![Postgres processes](/vagrant/pictures/gpdb_processes.png)
 
 (You may have to click on the image to see it at a higher resolution.)
 Here the key processes are the ones that were started as
-`/gpdb/vagrant/install/bin/postgres`. The master is the process (pid 25486
+`/usr/local/gpdb/bin/postgres`. The master is the process (pid 25486
 in the picture above) that has the word "master" in the `-D`parameter setting,
 whereas the segment hosts have the word "gpseg" in the `-D` parameter setting.
 
@@ -201,75 +192,5 @@ attach 25486
 Of course, you can change which function you want to break into, and change
 whether you want to debug the master or the segment processes. Happy hacking!
 
-##5: Want a larger cluster to play around with?
-If you want to play around with a larger cluster, then you can spin up the
-`gpdemo` cluster that is described in the
-[README.md]("https://github.com/greenplum-db/gpdb/blob/master/README.md") file. But, there a few things
-that you will have to do differently. Here are the steps.
-
-First, we will need to spin down the GPDB server that we just started above,
-as the setup for the demo has a different configuration. Issue the following
-command from your Vagrant shell:
-
-```shell
-gpstop
-```
-
-When prompted for a response, type in "Y".
-
-The demo cluster setup uses different environmental variable settings than
-what we added to our environment above. So, we need to clear them out. The
-easiest way is to log out of the VM and log right back in (the variables that
-we need to remain set will be preserved as we added them to the
-`~/.bash_profile` file).
-
-Exit vagrant by typing:
-
-```shell
-exit
-```
-
-Now we are back in the host OS, and we can spin up Vagrant again.
-
-```shell
-vagrant ssh
-```
-
-The code for the GPDB demo cluster is in `/gpdb/gpAux/gpdemo`. The scripts
-there will attempt to build a data directory in the `gpdemo` directory. As you
-may recall `/gpdb` is a shared directory between the host OS and the guest OS.
-This shared directory does not behave like a regular file system in the guest OS
-in many cases. So, we need to run the demo scripts from a location that is not
-in the shared filesystem. (For those who are curious about more details, the
-`initdb` initialization will, in many cases, fail if the data directory is
-under `/gpdb`).
-
-We already have a GPDB data directory, so let us go there and copy the demo
-files there.
-
-```shell
-su - gpadmin
-cd /home/gpadmin/gpdb_data/
-cp -R /gpdb/gpAux/gpdemo .
-cd gpdemo
-```
-
-Then, run the `gpdemo` scripts to build a cluster.
-
-```shell
-make cluster
-source gpdemo-env.sh
-```
-
-Finally, we are ready to run queries, so we fall back to the flow above as:
-
-```
-DBNAME=$USER
-createdb $DBNAME
-psql $DBNAME
-```
-
-You are back in the GPDB shell and you can fire away SQL queries as before. Now,
-you actually have a GPDB cluster running in Vagrant. You can use `top` to
-examine the processes that are running, or use `htop` (we installed that as part
-of the original Vagrant install).
+##4: GPDB without GPORCA
+If you want to run GPDB without the GPORCA query optimizer, run `vagrant up gpdb-without-gporca`.