Referring to this example, we create a canonical dev environment for Go and Python developers using Docker images.
When we use this Docker image for daily development work, the source code relies
on the host computer instead of the container. The source code includes this repo
and all its dependencies, for example, the Go package google.golang.org/grpc
.
Code-on-the-host allows us to run our favorite editors (Emacs, VIM, Eclipse, and more)
on the host. Please free to rely on editors add-ons to analyze the source code
for auto-completion.
We build a Docker image that contains development tools below.
- Python Interpreter
- Go compiler
- Protobuf compiler
- Protobuf to Go compiler extension
- Protobuf to Python compiler extension
Because this repo contains Go code, please make sure that you have the directory structure required by Go. On my computer, I have GOPATH set to $Home/go, you can have your $GOPATH
pointing to any directory as you like.
export GOPATH=$HOME/go
Now that $GOPATH$
is set, we could git clone the source code of our project by running:
go get github.com/sql-machine-learning/sqlflow
Change the directory to our project root, and we can use go get
to retrieve
and update Go dependencies. Note -t
instructs get to also download the packages required to build
the tests for the specified packages. As all Git users would do, we run git pull
from time to time to sync up with
others' work. If somebody added new dependencies, we might need to run go -u ./...
after git pull
to update dependencies.
cd $GOPATH/src/github.com/sql-machine-learning/sqlflow
go get -u -t ./...
To build the project, we need protobuf compiler, Go compiler, Python interpreter and gRPC extension to protobuf compiler. To prepare our dev environment with these tools, the easist way is to pull latest image from DockerHub by running command below and give it an alias sqlflow:latest. Alternatively, we provide a Dockerfile where can build image from. Note it will take a while to build from Dockerfile, especially when the network is unpredictable.
docker pull sqlflow/sqlflow:latest
docker tag sqlflow/sqlflow:latest sqlflow:latest
or
docker build -t sqlflow:latest .
We build and test the project inside the docker container. To run the container, we need to map the $GOPATH
directory on the host into the
/go
directory in the container, because the Dockerfile configures /go
as
the $GOPATH
in the container:
docker run --rm -it -v $GOPATH:/go \
-w /go/src/github.com/sql-machine-learning/sqlflow \
sqlflow:latest bash
Inside the Docker container, start a MySQL server in the background
service mysql start
run all the tests as
go generate ./...
SQLFLOW_TEST_DB=mysql go test -v ./...
where go generate
invokes the protoc
command to translate server/sqlflow.proto
into server/sqlflow.pb.go
and go test -v
builds and run unit tests. The environment variable
SQLFLOW_TEST_DB=mysql
specify MySQL as the backend, you can also check test_hive.sh and
test_maxcompute.sh to run the unit tests with other backends.
The demo requires a MySQL server instance with populated data. If you don't, please follow example/datasets/README.md to start one on the host. After setting up MySQL, run the following inside the Docker container
go run cmd/demo/demo.go --datasource="mysql://root:root@tcp(host.docker.internal:3306)/?maxAllowedPacket=0"
You should be able to see the following prompt
sqlflow>