Fundamentals

Misc

Notes from the course, “A Cloud Guru: Docker Fundamentals”
Resources
- The Ultimate Docker Cheatsheet
- Docs: Guides, Manuals, and Reference
- Course: Container Essentials: Build, Deploy, Scale
  - More advanced topics: swarm config, multi-container, app deployment, security, alt docker runtime
To start a Docker project, run docker init
- Intro
- Execute the command in the target project folder and it will generate docker files according to the programming language. Currently, the only DS language supported is Python.
- Sets up the essential scaffolding for your project.
  - A .dockerignore to keep unwanted files out,
  - A Dockerfile tailored to your project’s needs,
  - A compose.yaml for managing multi-container setups
  - A README.Docker.md for documentation.
Docker file linting tools
- hadolint
- Synk
- Trivy
- Claire
- Anchore
Docker help commands (every command preceded by docker <space> <command>)
- Every management command has its own subset of commands associated with it
- Management
  - container, image, network, node, plugin
  - secret, service, stack, swarm, system, volume
  - Example: docker image --help
    - Shows sub-commands for management command image and short descriptions
    - For even more information on a sub-command, add –help after the sub-command
      - Example: docker image build --help
Linux OS stuff
- Debian is a very stable Linux distribution
  - Debian-Jessie is the latest version of Debian
    - (As of July 6 2019, it’s called buster)
    - Slim is a light-weight Debian-Jessie
  - Debian-Wheezy is the version before Jessie
- Alpine is a very small Linux distribution (much smaller than even Slim)
An instance of an image or the result of running an image is called container
- Any changes made while running the container is lost once that container has been stopped.
  - If you create or add a file while in the container, stop the container, rerun the container, then it will no longer be there.
An image is the setup of the virtual computer. A combination of a file system and parameters. A package that rolls up everything you need to run an application.
- Composed of stacked layers where the layers are self-contained files
- You can download, build, and run image, but they cannot be changed (immutable)
You can have many running containers of the same image.
Docker Hub is a registry for docker repositories. It’s like a github for images. Each repo has 1 image but the repo can store many tagged versions of that image (version control)
Pull and run a docker image from Docker Hub from local cli
- docker run url/repo/image
  - Example: docker run docker.io/library/hello-world
    - docker.io is docker hub
    - library is the repo name for all “official” images
    - hello-world is the name of image
Two ways to create a image
- While inside a container, make changes. Then use commit command to create the new image with the changes
  - Never used. Creating a dockerfile is superior.
- Using a dockerfile
Images on docker hub
- mdancho/h2o-verse
  - h20, tidyverse, tensorflow, keras
  - ~2 GB

Creating a Dockerfile

Example of a toy python flask app

Architecture

Start with FROM base:tag
- Essentially copy-pastes the base image
- Usually good to start with a base image
  - Example: FROM python:2.7-alpine
    - python is the base image and 2.7-alpine is the tag
ARG: Defines a variable that users can pass at build-time to the builder with the docker build command using the –build-arg <varname>=<value> flag.
- Don’t use for secrets since build arguments are visible in the docker history command and in max mode provenance attestations, which are attached to the image by default if you use the Buildx GitHub Actions and your GitHub repository is public. See RUN –mount=type=secret docs for the secure way to use secrets.
- ARG variables placed before the FROM instruction will not be available after the FROM instruction
- Example: Variables with defaults
```
FROM busybox
ARG user1=someuser
ARG buildno=1
# ...
```
- Example: Before and After FROM
  - Before
```
# syntax=docker/dockerfile:1

ARG DEBVER="10"
ARG CAPVER="7.8"
FROM debian:${DEBVER}

RUN printf "DEB=${DEBVER}\nCAP=${CAPVER}\n" > /tmp/log
```
    - DEBVER=“10” can be used to specify the debian version you want, but it will not be written to the log file since it is before the FROM instruction
  - After
```
# syntax=docker/dockerfile:1

ARG DEBVER="10"
FROM debian:${DEBVER}
ARG DEBVER
ARG CAPVER="7.8"

RUN printf "DEB=${DEBVER}\nCAP=${CAPVER}\n" > /tmp/log
```
    - I guess the ARG DEBVAR after the FROM instruction pulls its value from the FROM instruction when no value is specified with –build-arg
    - Run image and check value in log file
      docker run --rm <tag> cat /tmp/log
RUN executes commands or scripts as if you were in the container’s OS
- Example: RUN mkdir /app
  - Makes a directory called “app”
WORKDIR sets the working directory for everything that happens after this command
- Example: WORKDIR /app
COPY has source path+file (local) and a destination path+file (container) as args
- Example: COPY requirements.txt requirements.txt
  - The first requirements.txt is in the same directory as the docker file so no “/other-dir/” required
  - The second “requirements.txt” says the file is to be placed into the /app dir
    - Equivalent to “/app/requirements.txt” because /app is our working directory
- Cannot use .. to move above the dockerfile directory. Every path must be below it.
Install packages
- Example: RUN pip -install -r requirements.txt
  - Inside requirements.txt says Flask==0.12
Copy entire local directory to the working directory
- COPY . .
  - First period says copy everything is the current local directory
  - Second period says put everything in the working dir
LABELs have key-value pairs. can be useful in production settings. The can be retrieved with a command later on.
- Some uses:
  - Filter a container’s output based on a label
  - Can include scripts for automatic load balancing (also see aws load balancer section below)
- Example:
```
LABEL maintainer="Eric Book <ericbook\@email.com>"\
      version="1.0"
```
- Often changed or added to, so they should be close to the end of the dockerfile (but not the last line)
CMD gives the default instruction when the image gets ran which is different from RUN commands, which are executed when the image is built.
- Example: CMD flask run –host=0.0.0.0 –port=5000
- Under-the-hood: CMD instructions are arguments to an entrypoint script and get run through a default docker entrypoint (See waaaaay below for info about entrypoints)
A \ is Linux operator that chains together instructions so they can be in separate lines of code for easier reading. Think a space also does the same thing, but you can’t see it in the code.
The ordering of the inputs in the dockerfile will affect its size and efficiency
- The source code is much more likely to change in the future than the package dependencies. Therefore even though there would be fewer COPY commands, it’s best to install the dependencies before copying the source code.
- Whenever changes to the source code occur Docker has a caching mechanism, so that it doesn’t rebuild everything above the layer where the changes occur. COPY . . executes in millisecs so the rebuild happens almost instantly while installing dependencies could take minutes.
Attaching packages and libraries
- Download and install R packages
  - Example: RUN R -e “install.packages(c(‘shinydashboard’, ‘reticulate’, ‘shiny’))”
- Download, install Python libraries (ubuntu image)
  - Example:
```
RUN ln -s /usr/bin/python3 /usr/bin/python && \
    ln -s /usr/bin/pip3 /usr/bin/pip
RUN apt-get update
RUN apt-get install -y libpython-dev
RUN apt-get install -y libpython3-dev
```

Images

Build image from dockerfile
- ** don’t forget the dot at the end **
- End of build shows hash id successfully built <id>
- Example: docker image build -t web1 .
  - image is the management command
  - build is the sub-command of image
  - -t is the flag for “tag”
  - web1 is the tag. This allows us to to refer to this image as “web1” instead of a hash
  - . says build the image and place in current directory
- Example: docker image build -t web1:1.0
  - Versioning the image
  - For just web1 the version will be web1:latest
Inspect the image
- Command: docker image inspect web1
- Info in json format
- At the top complete hash id
- Shows the versions of the image under repotag
- Various info about how the layers were created
- The number of layers created is shown at the bottom by the lines preceded by a “sha” and a hash
List of images in local docker host
- Command: docker image ls
  - Base images loaded from dockerfiles listed alongside the images we create
  - <none> images are “dangling images.” They are failed image builds or images that were built on top of already existing images
    - Safe to delete; Frees up disk space
Delete local image
- Using name and version tag
  - Command: docker image rm web1:1.0
- Using id
  - Command:docker image rm -f 633f
    - -f is the “force” flag
      - Necessary when image has been tagged/copied (like when pushing to docker hub) and you want to remove both images.
    - 633f - Only need the first four characters of the hash id
      - IDs in docker image ls

Docker Hub

Login to Docker Hub
- Have to do it once then a config file is created so you don’t have to do it again
- Input your username and password for your Docker Hub acct
- Command: docker login
Push image to Docker Hub
1. Tag image with docker hub username
  - Example:docker image tag web1 ercbk/web1:latest
    - web1 is the name of the image we want to push
    - ercbk/web1:latest
      - ercbk is the username
      - web1 is the repo
      - latest is the tag
  - docker image ls will show the newly tagged image
2. Push image to docker hub
  - Example: docker image push ercbk/web1:latest

Running Containers

Misc

** Unless you include -d in the run command, you will need to open a separate terminal (i.e. different from the one you ran the run command in) in order run more docker cli commands while an app is running **
** Container names and ports need to be unique for each running container **
Example 1:
```
docker container run -it -p 5000:8000 -e FLASK_APP=app.py web1
```
- web1 is the name of the image.
- This is probably least number of flags necessary to run an app in a container

Example 2:

docker container run \
  -it --rm \
  --name web2 \
  -p 5000:8000 \
  -e FLASK_APP=app.py \
  -d --restart on-failure \
  web2

Commands

docker container ls
- Lists all running containers, container_id, image, creation time, runtime, name
- -a
  - Shows all stopped containers
docker container rm <name>
- Deletes a stopped container
- Can also use 1st four characters of container_id
  - Shown in ls (see above)
docker container run
- The basic run command
- Hardly ever want to use just the basic commmand
  - See flags and examples below
docker container stop <name>
- Stops a container
- Add more names to stop more than one container
  - With spaces between the names
- Also, ctrl + c
  - Use it in the same terminal you used the run command in.
  - Only works if you included the -it flag when you started it.
- Also see –rm below
- docker container stop $( docker container ls -q)
  - Stops all running containers
  - -q flag says list them quietly, so it doesn’t print them out
  - If you a “stop requires at least one argument” error, then there aren’t any containers running
docker container logs <name>
- Can also use 1st four characters of container_id
- For active or inactive containers
- Shows times container was accessed
- -f
  - Runs log in the foreground (i.e. you can view accesses in real-time)
  - ctrl + c to kill it
docker container stats
- Real-time monitoring of active containers
- Shows cpu and memory usage for each container
- Network input/output
exec
- Executes an interactive bash session in the container
- Container must be running, so open a new terminal window and type the code line:
  - docker container exec -it web1 bash
    - Where web1 in the name of the container
    - Bash is for the Slim distribution of Linux; use sh for Alpine
  - Or run a bash command detached in the background docker exec -t -d web1 bash "ls -al"
- You’ll be logged in under “root” and dropped into wherever you designated the working directory in the dockerfile
- Type ls -la to view the files
- Example: Debugging
  1. Python app isn’t showing changes after running it with a volume flag (-v)
  2. You exec a bash session inside the container
  3. Delete some .pyc files (might be corrupted somehow) that are created when python runs flask
    - rm \*.pyc
    - ls -la to confirm
  4. Go to local script and make changes, save
  5. Go to terminal window where container is running and see changes to scripts detected in the terminal
  6. Go to browser where app is running and refresh
  7. Wait a few secs and changes show up. Yay.
  8. Go back to bash terminal window, ctrl + d to kill it

Flags

** Don’t think order matters except for that the name of the image needs to be last **
-d
- Runs the container in the background
- Allows you to run commands in the same terminal window that you used the container run command in
-e
- Allows you to pass an environment variable into the container
- Not always necessary
  - Example: Flask requires one
    - -e FLASK_APP=app.py)
    - app.py is the name of the app script
- Multiple -e flags are allowed
  - Example: -e FLASK_DEBUG=1
    - Turns debug mode on when you run the container
    - Use along with -v to make real-time changes to the app while the container is running
-it
- Allows for unix commands such as ctrl c to kill a process (such as a running container)
- Makes the docker container interactive. Which allows you to go into the container, navigate the file system, and make changes.
–name
- Docker automatically provide a name for a container but this flag allows you to provide your own
- Example: –name moose
-p
- Ports
  - Maps ports from local machine to ports within docker
- Expected to supply two ports separated by a colon
- Example: -p 5000:5000
  - The first 5000 is the host port. The port you use to interact with the app in your browser (e.g. localhost:5000)
    - All containers running on your docker host need to have unique ports to run on. (e.g. -p 5001:8000)
    - So, I think this can be any port you want as long something else isn’t already using it.
  - The second 5000 is the container port that was specified in the dockerfile
    - Tried 5000:5000 and the app didn’t run in the browser even when the dockerfile had 5000 specified, but after specifying 800 in the dockerfile, -p 5000:8000 did work
      - This was run on the default docker network
    - Running -p 5000:5000 DID work on a custom network.
      - Also these are 2 different apps. The 1st one was stand-alone (03-lecture cloud guru docker fundamentals), and the 2nd ran in conjunction with a redis server (see custom network section below)(09-lecture in cloud guru docker fundamentals). Not sure if that makes a difference
- When running the container, if it’s an app, it can be viewed in a browser at localhost:5000 (i.e. the first port specified)
- Example: -p 5000
  - Docker will attach a random port number
- You can specify more than one port mapping e.g. -p 5000:5000 -p 3838:3838
- Ports (second port specified) for common images
  - redis (storage) - 6379
  - nginx (web server) - 80
    - Open source, handles a lot of connections efficiently, used to host a lot of websites
  - RStudio: -p 3838:3838
  - Shiny apps launched from within RStudio: -p 8787:8787
–restart
- With value, on-failure
  - Says restart the container if there’s some catastrophic failure (e.g. docker daemon dies and restarts)
  - Useful in production
- Cannot be included if –rm is also used
–rm
- Deletes the container once it has been stopped
- Cannot be included if –restart is also used
-v
- Volumes
- Use cases
  - While developing. Makes changes inside the container in real-time.
  - Store data in a db
  - Mapping to a folder on your local machine allows you to work on projects stored within the container
- 1st Half
  - Requires address (local machine) to the directory with the script (e.g. app script not dockerfile) where you’re making changes <colon>
  - For Linux, you can used the shortcut, $PWD which is a linux environment variable that stores the path to the working directory. It has the same value as the command pwd -L which prints working directory.
    - FYI the -L has something to do with symlinks which are things that can be created that point to another directory. If you’re in in the symlink directory -L will print that directory and -P will print the directory that the symlink points to.
      - See
        
        CLI, Linux >> Misc >> Create Symlink
        
        CLI, Windows >> Powershell >> Misc >> Create a symlink
    - Same thing for Mac, except the quotes might need to be included
  - For windows, /c/users/<user_name>/path/to/directory
- 2nd Half
  - The address inside the container where this directory should be “mounted” (i.e. where the script you’re making changes to is located inside the container)
  - /app which is where the app script is (also the working directory specified in the dockerfile for the example)
- Examples:
  - Linux
```
docker container run \
  -it \ 
  -p 5000:5000 \ 
  -e FLASK\_APP=app.py \
  --rm \
  -name web1 \
  -e FLASK\_DEBUG=1 \
  -v $PWD:/app \
  web1
```
  - Windows
```
docker container run \
  --rm -itd \ 
  -p 5000:5000 \ 
  -e FLASK_APP=1 \ 
  -e FLASK_DEBUG=1 \ 
  --name web2 \ 
  -v /C/Users/tbats/Documents/R/Projects/Docker/Courses/cloud-guru-docker-fund/09-linking-containers-with-docker-networks:/app \ 
  web2
```
- While running, if you open a new terminal and docker container inspect <name>, then there should be a mount section with type = bind
- Troubleshooting if changes don’t show up in the container in real-time
  - It’s probably the path specifications for the value of the -v tag. Docker is picky, esp w/WIndows.
  - Make sure code script and docker are on same drive
  - Next try replacing Alpine Linux with Slim Linux in the dockerfile $\rightarrow$ rebuild image $\rightarrow$ run container with volume flag + the addresses like stated above
    - Something about inotify in Alpine not doing something
  - See exec command above

Networking Containers

Internal Networks (LANs), External Networks (WAN)
- WAN is a wide area network. Can be public or private. Stretches across city or industrial park, etc

Commands

Access addresses
- Servers bound to 0.0.0.0:<port number> give access to any computers on your LAN or WAN
- If localhost:<port number>, then only laptop running the server can connect to it
- If <laptop address thats running server>:<port number> then any computer on your LAN, WAN, or on the internet can connect
  - e.g. 192.168.1.4:5000
List of Docker networks
- docker network ls
- ipconfig (windows) ifconfig (Linux, Mac)
  - Shows info about networks
    - Bridge network is docker0 which is the docker supplied network
  - FYI docker0 didn’t show up on Windows for me
Ping from one container to another
- Note: ping and ifconfig removed from Alpine and Slim image
  - To reinstall, add this to 2nd line of dockerfile
    - Alpine
      - RUN apk update && apk add iputils
    - Slim
      - RUN apt-get update && apt-get install -y net-tools iputils-ping
- docker exec web2 ping 172.17.0.2
  - Where 172.17.0.2 is the other container’s eth0 inet address
    - Found by docker exec <container name> ifconfig
- ctrl + c to stop the pinging
View etc file
- docker exec <container name> cat /etc/hosts
- Shows eth0 inet address is mapped to container id

Create a Custom Network

Allows us to connect containers by name which means if the addresses change, the apps won’t break.
docker network create --driver bridge <name>
- The bridge driver is used for networking containers on the same docker host
- For networking across multiple docker hosts, the overlay driver is used. (would need to research this further)
Inspect network
- docker network inspect <name>

Add container to custom network

Add –net <network name> to the run-container instruction

Example:

docker container run \
  -it --rm name web2 \
  -p 5000:5000 \
  -e FLASK\_APP=app.py \
  -d --net firstnetwork \
  web2

Example:
```
docker container run \
  -it --rm name redis \
  -p 6379:6379 \
  -d --net firstnetwork \
  redis:3.2-alpine
```
- This container only linked up with the app running on the browser when they were run on the custom network and not the default docker bridge network

Example: In debug mode

docker container run \
  -it --name web2 \
  -p 5000:5000 \
  -e FLASK\_APP=app.py \
  -e FLASK\_DEBUG=1 \
  -v /C/Users/tbats/Documents/R/Projects/Docker/Courses/cloud-guru  -docker-fund/09-linking-containers-with-docker-networks:/app 
  -d --rm --net firstnetwork \
  web2

Containers will show up when you inspect firstnetwork
Can now ping using container names (assuming both containers have been added to the network):
- e.g. docker exec web2 ping redis
  - web2 is the container doing the pinging
  - redis being pinged

Data Volumes

Allows data to persist on docker host after the container is stopped
- Should save on the host for apps because they should be portable
- For databases, not so bad
A volume is nothing more than a folder on your computer that is linked to a folder inside the Docker container.
Default volume path on host, /var/lib/docker/volume/

Commands

docker volume create web2\_redis
- web2_redis is the name of the volume
  - Good idea to pick a name that’s relevant to job
docker volume ls
- List of volumes
docker volume inspect web2_redis
- Shows info about volume
- The Mountpoint column shows where the volume will be stored on the host machine
docker volume rm <volume1 name> <volume2 name>
- Remove specific volumes by name
docker volume prune
- Removes all volumes
docker rm -v <container name>
- Removes container and anonymous volume
  - Will not remove a named volume
- -v required else a “dangling” is created
docker volume ls -qf dangling=true
- Or docker volume rm $(docker volume ls -qf dangling=true)
- Removes dangling volumes
Add volume flag, data volume name, and destination to container
- Example:
```
docker container run \
  -it --rm name redis \
  -p 6379:6379 \
  -d --net firstnetwork \
  -v web2_redis:/data \
  redis:3.2-alpine
```
  - web2_redis is the name we gave to the data volume
  - /data is designated by the redis people
    - This technique works for mysql, postgres, elasticsearch, etc. You just have to figure out the WHERE they decided that they want you to store your data (i.e. the /data part)
    - Go to their docker hub image page $\rightarrow$ view readme $\rightarrow$ look for section on persistent storage
    - e.g I went to postgres page and did an ctrl+f “persistent” and found a section describing when to use /data or /pgdata
Saving the data
- Redis does it automatically every 30 sec
- Manual save if you need to save right away: docker exec redis redis-cli SAVE

Docker Compose

Example: Named Volume

version: '3.8'
services:
  db:
    image: mysql
    restart: always
    environment:
      MYSQL_ROOT_PASSWORD: root
      MYSQL_DATABASE: test_db
    ports:
      - "3306:3306"
    volumes:
      - db_data:/var/lib/mysql
volumes:
  db_data:

db_data is the name
/var/lib/mysql is the path inside the container
Advantages
- Data persists after we restart or remove a container
- Accessible by other containers

Example: Unnamed (aka Anonymous) Volume
```
version: '3.8'
services:
  db:
    image: mysql
    restart: always
    environment:
      MYSQL_ROOT_PASSWORD: root
      MYSQL_DATABASE: test_db
    ports:
      - "3306:3306"
    volumes:
      - /var/lib/mysql
```
- No volume name here but still has the path inside the container
- Data will persist on restart of the container, but not after the container is stopped and removed
- Not accessible by other containers
- Actually… even without the volumes instruction, the mysql image/dockerfile has a VOLUME instruction, so an anonymous volume would still be created.
Example: Bind Mounts
```
version: '3.8'
services:
  db:
    image: mysql
    restart: always
    environment:
      MYSQL_ROOT_PASSWORD: root
      MYSQL_DATABASE: test_db
    ports:
      - "3306:3306"
    volumes:
      - $PWD/data:/var/lib/mysql
```
- Instead of using the default host directory for the volume, you can specify a location yourself
- First half (before colon): Where on the host machine to mount (i.e. create) the volume
  - In the example, it’s located in the working directory + /data
  - For Linux, you can used the shortcut, $PWD which is a linux environment variable that stores the path to the working directory. It has the same value as the command pwd -L which Prints Working Directory.
    - FYI the -L has something to do with symlinks which are links that can be created that point to another directory. If you’re in in the symlink directory -L will print that directory and -P will print the directory that the symlink points to.
    - See
      - CLI, Linux >> Misc >> Create Symlink
      - CLI, Windows >> Powershell >> Misc >> Create a symlink
    - Same thing for Mac, except the quotes might need to be included
  - For windows, /c/users/<user_name>/path/to/directory
- Second half (after colon): Specify the path inside the container you want to use for the volume. Usually specified by the db software.
  - In the example, it’s /var/lib/mysql which has been specified by the mysql image

Sharing Data Between Containers

Containers sharing and receiving need to be on the same docker host
Add to app dockerfile
- VOLUME [“/app/public”]
  - Line location in the dockerfile wasn’t specified but he put his right before the CMD line
  - Local directory with app has a folder named public that will be shared with other containers.

Steps

Build app image
Run app image

Run redis image with flags, –volume -from <app container name>

docker container run \
  -it --rm name redis \
  -p 6379:6379 \
  -d --net firstnetwork \
  -v web2_redis:/data \
  --volume -from web2 \
  redis:3.2-alpine

Files in app’s public folder will be in redis’s container
- Steps
  1. Go into redis container: docker container exec -it redis sh
  2. Change directory to the public folder: cd /app/public
  3. Contents of app’s public folder are in this folder too
    - Examine contents by printing to the terminal: cat main.css
While containers are running, you can exec into the volume container (e.g. web2) add files, make changes to files, etc., and the files in the other containers will be updated in real time.
Alternate method (*not recommended for production*)
- Don’t put the VOLUME instruction in the dockerfile
- Add -v /app/public to the app run command
- Add the –volume -from flags to the redis container run command like before

Optimizing

.dockerignore
- Contains file paths to files in the local project directory that you don’t want on the image (e.g. files with private data, .git files can be huge)
- During the image build, when docker runs the COPY/ADD instructions, it will bypass the files in the .dockerignore file
- File paths in the .dockerignore file begin wherever you’ve designated the working directory in WORKDIR in the dockerfile
- Examples:
  - .git/
    - Adding a trailing / isn’t necessary but lets people know it’s a directory and not a file
  - May want to include the .dockerignore itself
  - folder/*
    - The folder itself will be added to the image but all the content will be ignored
  - **/*.txt
    - Says ignore all files with the .txt extension
  - !name.txt
    - This grants an exception to the name.txt file. It will be added to the image even with the **/*.txt
  - You can negate the .dockerignore file
    - Just a * on line 1
    - Begin each file path with a ! to specify which files to include
Remove the system build dependency files
- In other words the files used to build the system dependency files
- *Only for Alpine Linux
- He uses the example of a postgres dependency in the video, but no dependencies are actually listed in the dockerfile in the files folder.
  - 2 guesses on what he’s talking about:
    1. The package you use in python to interact with the sql db has dependencies and files are needed to build those dependencies (build dependencies). After the dependencies are built, the build dependencies get deleted.
    2. This image which includes the sql db and that db has build dependencies. They get deleted
  - I think it’s 1., but I dunno. In the installing packages section above, there are examples of python-dev files being installed, but I’m not sure which guess that favors.
- It’s a bear of a bash script, so see dockerfile for details in cloud guru docker fund course, 012 lecture files folder on optimizing
  - Script stays above the COPY . . instruction
  - If you use this script, thoroughly check everything and make sure the packages are working as intended.
- Example: Postgres DB
```
RUN apk add --no-cache --virtual .build-deps \
    postgressql-dev dependency2 dependency3 \
    && pip install -r requirements.txt \
    && find /usr/local \
        $ -type d -a -name test -o -name tests $ \
        -o $ -type f -a -name '*.pyc' -o -name '*.pyo' $ \
        -exec rm -rf '{}' + \
    && runDeps="$( \
        scanelf --needed --nobanner --recursive /usr/local \
                | awk '{ gsub(/,/, "\nso:", $2); print "so:" $2 }' \
                | sort -u \
                | xargs -r apk info --installed \
                | sort -u \
    )" \
    && apk add --virtual .rundeps $runDeps \
    && apk del .build-deps
```

Running Scripts When a Container Starts

Instead of making multiple similar dockerfiles/images, you can use entry points into one dockerfile/image
- Examples:
  - For a PostgreSQL image, you pass a RUN instruction inside the dockerfile that sets up your user authorization and password as an environment variable. It also has an ENTRYPOINT, so that if you have other projects that use a PostgreSQL db, they can gain access to that information through the entry point.
  - Running a db migration after a container starts
    - run_db_migration1 as environment variable but with a default value set to off. You can control that action through entrypoint and a script
  - Control stuff in a nginx config to set an external ip after the container is running
The docker-entrypoint scripts aren’t in your dockerfile so they don’t add layers to an image
Steps
1. Create Dockerfile
```
1COPY docker-entrypoint.sh /
2RUN chmod +x /docker-entrypoint.sh
3ENTRYPOINT ["/docker-entrypoint.sh"]
```
  1
  
  docker-entrypoint.sh should be a file in the root project directory and not in app directory, because it’s best practice to keep entrypoint files separate.
  
  2
  
  Tells Linux to give the entrypoint script permission to run
  
  3
  
  Entry point instruction that points to where the script is located.
2. Run redis container as before
3. Build app image as before (gave it the name, webentrypoint)
4. Run app container
```
docker container run \
  -it --name webentrypoint \
  -p 5000:5000 \
  -e FLASK_APP=app.py \
  -e FLASK_DEBUG=1 \
  --rm --net firstnetwork \
  webentrypoint
```
  - Name and image values changed to webentrypoint
  - Removed -d flag because we want it to run in the foreground
5. If you go to localhost:5000 it has some message printed from the docker-entrypoint.sh script
6. Stop the app container
7. Re-run the app container
```
docker container run \
    -it --name webentrypoint \
    -p 5000:5000 \
    -e FLASK_APP=app.py \
    -e FLASK_DEBUG=1 \
    -e WEB2_COUNTER_MSG="Docker fans will have visited this page" \
    --rm --net firstnetwork \
    webentrypoint
```
  - WEB2_COUNTER_MSG is an environment variable that was given a default value inside the docker-entrypoint.sh script.
8. So we were able to change the environment variable without having to run the container, exec into the container, then change it.
Details on the dockerentrypoint.sh file
- When you run the container, this script gets run before the dockerfile
- Exports an environment variable that gets accessed by the app.py script
- File
```
1#!/bin/sh
2set -e
3echo "The Dockerfile ENTRYPOINT has been executed!"
4export WEB2_COUNTER_MSG="${WEB2_COUNTER_MSG:-carbon based life forms have sensed this website}"
5exec "$@"
```
  1
  
  Shebang
  
  2
  
  Says abort if there’s an error in the script
  
  3
  
  Line that prints in the terminal when we run the container
  
  4
  
  WEB2_COUNTER_MSG environment variable is created and a default value set
  
  5
  
  Says that after running everything in this script, then run the CMD stuff in the dockerfile

Docker Utility Functions

Can be run in any directory
docker system df
- Shows resources being used by docker
- Adding -v tag at the end, produces a verbose output all the info is broken down further
docker system info
- Info about your docker installation
  - Useful when reporting bugs, creating issues
  - Verify docker installation
docker system prune
- Deletes all the crud
  - Stopped containers that you forgot to include –rm
  - Volumes not used by at least one container
  - Networks not used by at least one container
  - All dangling images
- Add -f flag to execute the prune without the confirmation message
  - Useful for automating through cron jobs, etc.
- Add -a flag to remove ALL unused images
  - * Only run if want every image not being used by a running container to be deleted *

Docker Compose

Misc
Note dash between docker and compose in commands
yaml files do not need to be in the same directory as your dockerfiles. You just have to give the path in the build property (see below)
Example:
```
version: '3'


# pound sign is for comments
services:
  redis:
    image: 'redis:3.2-alpine'
    ports:
      - '6379:6379'
    volumes:
      - 'redis:/data'

  web:
    build: '.'
    depends_on:
      - 'redis'
    env_file:
      - '.env'
    ports:
      - '5000:5000'
    volumes:
      - '.:/app'

  worker:
    build: '.'
    command: celery <command>
    depends_on:
      - 'redis'
    env_file:
      - '.env'
    volumes:
      - '.:/app'

volumes:
  redis: {}
```
- version is the version of the compose api
- services are the containers we’re building
  - Service names (e.g. redis, web) will end up being the container and image names (See below for the exception)
- Properties
  - image - Uses the base:tag format like the value for the FROM instruction in the dockerfile
  - build - ‘.’ says build an image from the current directory - If your app has it’s own folder then you need to specify the path - e.g. ‘./web’
  - image and build property in the same service
    - Docker will build 2 of the same image
      - One with project name (build) and the other with the image value as the name for the image
    - Example build: ‘.’ and image: ’ercbk/web:1.0
      - Says build image using dockerfile in current directory and name it ercbk/web:1.0
      - Useful if the image is going to be pushed to the docker hub
  - ports
    - Ports to be used for the container (See Running Containers >> Flags >> -p)
      - The “bind” port (right side) supplied here needs to match the dockerfile
    - A <dash> indicates a list in yamls.
      - Example: Forward 2 sets of ports
        
        - '6379:6379' - '5348:5348'
  - depends_on
    - Necessary if one service depends on another. Indicates if one container needs to start before another.
      - e.g. web (app) depends on redis (db)
    - Takes a value in list format
- worker
  - Use the same dockerfile for both services
    - e.g. worker uses the same dockerfile as web
  - Useful for background services for your app
    - {{celery}} used for this task (which someone said was garbage)
  - Differences
    - No need for port to be exposed (therefore overriding the CMD instruction in the dockerfile)
    - Uses command property which replaces the CMD instruction in the dockerfile
- volumes
  - The name given here needs to match the name of the volume given in the services property
    - e.g. For this yaml the name of the volume created is redis
  - {} - curly brackets are for adding options to the volume, such as being read-only (see docker docs for more options)
    - In this example no options are supplied so it’s empty (still necessary though)
Example: Databases and Workers communicate on one network while apps communicate on both networks
```
result:
  build: ./result
  command: nodemon --debug server.js
  volumes:
    - ./result:/app
  ports:
    - "5001:80"
  networks:
    - front-tier
    - back-tier
worker:
  build: ./worker
  networks:
    - back-tier
db:
  image: postgres:9.4
  volumes:
    - "db-data:/var/lib/postgresql/data"
  networks:
    - back-tier
```
- networks
  - Takes list inputs
  - Used when multiple networks are created
- volumes
  - See Data Volumes
  - <dash> means, just like for ports, values for this property are in list format
  - In the app service,
    - Specify which directory to share
    - .:/app says share the current local directory which will be the app directory in the container
  - In the redis service,
    - web2_redis is the name we give to the data volume
    - /data is designated by the redis people
Setting Environment Variables
- Method 1: environment
  - name: value pair
  - Example:
```
environment:
    FLASK_DEBUG: 'true'
```
- Method 2: env_file
  - Takes a list of environment files to load
  - Loads from top to bottom
    - If any variables in a lower listed file match those in a file listed higher up, then the values in the earlier file get overwritten to the values in the later file
    - Useful if you have a stock env file, decide to put the containers into production, then you can just add a production env file to the directory and the yaml
  - e.g. .env is the name of the file in the current directory
  - Example: File
```
COMPOSE_PROJECT_NAME=web2
PYTHONBUFFERED=true
FLASK_APP=app.py
FLASK_DEBUG=1
```
    - The flask variables are the ones we passed in the run containers section
    - Can configure Docker Compose options here
      - e.g. Supply a project name. Otherwise Docker would just use the current directory for the name of the project. Also gets added as a prefix for networks, etc. that Compose will create.
    - The Python buffer variable is necessary if you want to see the output from the terminal through Compose

Docker-Compose Commands

Current directory need to have the docker-compose.yml file
- Unless you use the -f flag
docker-compose --help
- Info on commands
docker-compose build
- Builds an image of any service in the yaml file with a build property
- docker image ls will show the built images with the project name as the prefix
  - Project name set in the .env file (See above)
docker-compose pull
- Pulls any other images specified with the image property in the yaml file
  - Not just local but also pulls from docker hub I think
docker-compose up
- Runs the project (everything created with project name prefix)
  - Creates network
    - eg web2_defaults
  - Creates volume
    - eg web2_redis
  - Starts containers
    - eg web2_redis_1, web2_web_1
      - The _1 suffix is in case the project calls for multiple instances of the same container
      - Reminder: Multiple apps require different ports (binding?). Would also require setting up a load balancer.
        
        See Running Containers >> Flags >> -p
        
        See Docker, AWS >> Create Application Load Balancer
- docker-compose up <service>
  - Starts a specific service
  - If you start a service with a dependency (depends_on specified in yaml), it will also start that dependency service.
  - docker compose web
    - Starts web but also redis, because web has a specified redis dependency
docker-compose stop
- Stops all containers
  - Can probably specify a container
    - eg docker-compose stop web
- Can also use ctrl+c, but (as of 2017) there’s a bug that throws an error and aborts instead
docker-compose up --build -d
- Runs both build and up commands at once
- -d says run in the background
docker-compose ps
- Similar info as docker container ls but presented slightly differently
docker-compose logs -f
- Since containers running in the background -f needed to logs of the container activity
- It’s a realtime log of all containers running in the project, so will require ctrl+c to exit
docker-compose restart
- Restarts all containers
- Can also just specify one container
  - eg docker-compose restart redis
docker-compose exec web
- For a running container, you can execute commands inside the container from the outside
  - docker-compose exec web ls -la
    - Shows file contents inside container
- Opening a shell inside the container
  - docker-compose exec web sh
    - No -it flag necessary like for docker container exec (See Running Containers >> Commands)
- exit
  - Exits the shell
docker-compose run <service> <command>
- Allows you to instantly run the container, execute a command inside the container, and exit the shell
- Equivalent to the command sequence: up $\rightarrow$ exec $\rightarrow$ exit
- e.g. docker-compose run redis redis-server --version
docker-compose rm
- Deletes all containers (only stopped ones I assume)
- These will also be removed using the docker system prune from Docker Utility Functions