-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathflow.go
344 lines (231 loc) · 12.6 KB
/
flow.go
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
/*
<!--
Copyright (c) 2017 Christoph Berger. Some rights reserved.
Use of the text in this file is governed by a Creative Commons Attribution Non-Commercial
Share-Alike License that can be found in the LICENSE.txt file.
Use of the code in this file is governed by a BSD 3-clause license that can be found
in the LICENSE.txt file.
The source code contained in this file may import third-party source code
whose licenses are provided in the respective license files.
-->
<!--
NOTE: The comments in this file are NOT godoc compliant. This is not an oversight.
Comments and code in this file are used for describing and explaining a particular topic to the reader. While this file is a syntactically valid Go source file, its main purpose is to get converted into a blog article. The comments were created for learning and not for code documentation.
-->
+++
title = "Get into the flow"
description = "Flow-based programming in Go"
author = "Christoph Berger"
email = "[email protected]"
date = "2017-02-25"
publishdate = "2017-02-25"
draft = "false"
categories = ["Concurrent Programming"]
tags = ["fbp", "flow-based programming", "dataflow"]
articletypes = ["Tutorial"]
+++
In Flow-Based Programming, programs are modeled as data flowing between independent processing units. Who would not think of channels and goroutines as a natural analogy?
<!--more-->
As trivial as this may sound, all software is about processing data. Yet, when you look at code written in a "traditional" programming language, the actual data flow is not readily visible. Instead, what you mainly see are just the control structures. The actual data flow only happens to occur at runtime, as a consequence from the control structures.
*Flow-Based Programming (FBP)* turns the view on code and data upside down. Here, the data flow is the first thing you look at; it is the main principle that defines the structure of your application. Processing of data happens within many small nodes that sit between the endpoints of data pipelines.
HYPE[FBP](FBP.html)
At this level, the processing nodes are just black boxes in a graphic flow diagram. The actual code hides within these boxes.
## Flow-based programming and concurrency
Looking at an FBP diagram immediately raises two thoughts.
First, the data flow model is inherently concurrent. Data streams are independent of each other, and so are the nodes. Looks like optimal separation of concerns.
Second, a data flow looks darned close to channels and goroutines!
Do we have a natural match here? It seems tempting to build an FBP model directly on Go's built-in concurrency concepts.
In fact, this has been done already.
## Go FBP libraries
A quick search on GitHub reveals a handful of Go-based FBP projects, which I list here together with their own descriptions.
### [trustmaster/goflow](https://github.com/trustmaster/goflow)
"This is quite a minimalistic implementation of Flow-based programming and several other concurrent models in Go programming language that aims at designing applications as graphs of components which react to data that flows through the graph."
### [scipipe/scipipe](https://github.com/scipipe/scipipe)
"SciPipe is an experimental library for writing scientific Workflows in vanilla Go(lang). The architecture of SciPipe is based on an flow-based programming like pattern in pure Go (...)
### [flowbase/flowbase](https://github.com/flowbase/flowbase)
"A Flow-based Programming (FBP) micro-framework for Go (Golang)."
### [ryanpeach/goflow](https://github.com/ryanpeach/goflow)
"A LabVIEW and TensorFlow Inspired Graph-Based Programming Environment for AI handled within the Go Programming Language."
### [7ing/go-flow](https://github.com/7ing/go-flow)
"A cancellable concurrent pattern for Go programming language"
### [cascades-fbp/cascades](https://github.com/cascades-fbp/cascades)
"Language-Agnostic Programming Framework for Data-Driven Applications"
### [themalkolm/go-fbp](https://github.com/themalkolm/go-fbp)
"Go implementation of Flow-based programming."
(The last one actually relies on input from a graphical FBP editor ([DrawFBP](https://github.com/jpaulm/drawfbp)) that it turns into code stubs.)
To be fair, some of these libs seem not actively maintained anymore. I included them anyway as there is no single true approach to this, and each of these libs shows a different approach and focuses on different aspects.
I also most certainly left out a few FBP libs that I failed to find in the short time of researching this topic, so feel free to do some more research on your own.
## A simple FBP flow
For today's code, I picked the first of the libraries above, `trustmaster/goflow`. It provides a quite readable syntax and comes with detailed documentation. (On the flipside, `goflow` uses quite some reflection inside, which some of you might frown upon.)
Our sample code is an incarnation of the schematic FBP graph in the initial animation. Let's turn the abstract nodes and data items into someting more tangible. For example, we could feed the network with sentences and let one node count the words in each sentence and the other all letters. The final node then prints the results.
HYPE[Counter](Counter.html)
## The code
First, we define the nodes. Each node is a struct with an embedded `flow.Component` and input and output channels (at least one of each kind, except for a sink node that only has input channels).
Nodes can act on input by functions that are named after the input channels. For example, if an input channel is named "Strings", the function that triggers on new input is called "OnStrings" by convention.
We define these nodes:
* A splitter that takes the input and copies it to two outputs.
* A word counter that counts the words (i.e., non-whitespace content) of a sentence.
* A letter counter that counts the letters (a-z and A-Z) of a sentence.
* A printer that prints its input.
None of these nodes knows about any of the other nodes, and does not need to.
*/
//
package main
import (
"fmt"
"regexp"
"strings"
flow "github.com/trustmaster/goflow"
)
// Our two `counter` nodes (see below) send their results asynchronously to the `printer` node. To distinguish between the outputs of the two counters, we attach a tag to each count. (Yes, sending just a string including the count would be easier but also more boring. The `splitter` already sends strings, so let's try something different here.)
type count struct {
tag string
count int
}
// The `splitter` receives strings and copies each one to its two output ports.
type splitter struct {
flow.Component
In <-chan string
Out1, Out2 chan<- string
}
// `OnIn` dispatches the input string to the two output ports.
func (t *splitter) OnIn(s string) {
t.Out1 <- s
t.Out2 <- s
}
// `WordCounter` is a `goflow` component that counts the words in a string.
type wordCounter struct {
// Embed flow functionality.
flow.Component
// The input port receives strings that (should) contain words.
Sentence <-chan string
// The output port sends the word count as integers.
Count chan<- *count
}
// `OnSentence` triggers on new input from the `Sentence` port.
// It counts the number of words in the sentence.
func (wc *wordCounter) OnSentence(sentence string) {
wc.Count <- &count{"Words", len(strings.Split(sentence, " "))}
}
// `letterCounter` is a `goflow` component that counts the letters in a string.
type letterCounter struct {
flow.Component
Sentence <-chan string
// The output port sends the letter count as integers.
Count chan<- *count
// To identify letters, we use a simple regular expression.
re *regexp.Regexp
}
// `OnSentence` triggers on new input from the `Sentence` port.
// It counts the number of words in the sentence.
func (lc *letterCounter) OnSentence(sentence string) {
lc.Count <- &count{"Letters", len(lc.re.FindAllString(sentence, -1))}
}
// An `Init` method allows to initialize a component. Here we use it to run
// the expensive `MustCompile` method once, rather than every time `OnSentence` is called.
func (lc *letterCounter) Init() {
lc.re = regexp.MustCompile("[a-zA-Z]")
}
// A `printer` is a "sink" with no output channel. It prints the input
// to the console.
type printer struct {
flow.Component
Line <-chan *count // inport
}
// `OnLine` prints a count.
func (p *printer) OnLine(c *count) {
fmt.Println(c.tag+":", c.count)
}
// `CounterNet` represents the complete network of nodes and data pipelines.
type counterNet struct {
flow.Graph
}
/*
### Assembling the network
With the nodes in place, we can go foward and create the complete network, adding and connecting all the nodes.
*/
// Construct the network graph.
func NewCounterNet() *counterNet {
n := &counterNet{}
// Initialize the net.
n.InitGraphState()
// Add nodes to the net. (I derived from the documentation by using `&{}`
// instead of `new`.) Each node gets a name assigned that is used later
// when connecting the nodes.
n.Add(&splitter{}, "splitter")
n.Add(&wordCounter{}, "wordCounter")
n.Add(&letterCounter{}, "letterCounter")
n.Add(&printer{}, "printer")
// Connect the nodes. The parameters are: Sending node, sending port,
// receiving node, and receiving port.
n.Connect("splitter", "Out1", "wordCounter", "Sentence")
n.Connect("splitter", "Out2", "letterCounter", "Sentence")
n.Connect("wordCounter", "Count", "printer", "Line")
n.Connect("letterCounter", "Count", "printer", "Line")
// Our net has 1 input port mapped to `splitter.In`.
n.MapInPort("In", "splitter", "In")
return n
}
/*
### Launching the network
Finally, we only need to activate the network, create an input port, and start feeding it with selected bits of wisdom.
*/
//
func main() {
// Create the network.
net := NewCounterNet()
// We create a channel as the input port of the network.
in := make(chan string)
net.SetInPort("In", in)
// Start the net.
flow.RunNet(net)
// Now we can send some text and see what happens. This is as easy as sending
// text to the input channel. (All aphorisms by Oscar Wilde.)
in <- "I never put off till tomorrow what I can do the day after."
in <- "Fashion is a form of ugliness so intolerable that we have to alter it every six months."
in <- "Life is too important to be taken seriously."
// Closing the input channel shuts the network down.
close(in)
// Wait until the network has shut down.
<-net.Wait()
}
/*
## How to get and run the code
Step 1: `go get` the code. Note the `-d` flag that prevents auto-installing the binary into `$GOPATH/bin`.
go get -d github.com/appliedgo/flow
Step 2: `cd` to the source code directory.
cd $GOPATH/src/github.com/appliedgo/flow
Step 3. Run the binary.
go run flow.go
The output should look like:
```
Letters: 45
Words: 17
Words: 13
Letters: 36
Words: 8
Letters: 70
```
The unordered output shows that the nodes are indeed running asynchronously. Homework assignment: Add more info to the `count` struct to allow the `printer` node grouping the output by input sentence.
## Conclusions
Still, although we were able to nicely describe our nodes and the connections between them, the resulting code is far from representing an intuitive view on the flow of data within the program. This should not be surprising. A textual representation rarely matches up with the intuitiveness of a graphic representation.
So where is the visual flow diagram editor, you ask?
There are indeed some options.
### Shenzhen Go
Just recently, an experimental visual Go environment has been presented to the public - [Shenzhen Go](https://google.github.io/shenzhen-go/). (Careful though - "experimental" means exactly this.)
![Shenzhen Word Counter](shenzen_word_counter.png)
Some nodes contain configurable standard actions, others contain Go code that reads from input channels and writes to output channels (unless the node is a sink).
![Shenzhen Print summary node](shenzen_print_summary.png)
### go-fbp and DrawFBP
If you want a graphic editor now and don't want to wait until Shenzhen Go is production ready, have a look at [themalkolm/go-fbp](https://github.com/themalkolm/go-fbp). This project generates Go code from the output of a graphical FBP editor called DrawFBP (a Java app). (Disclaimer: I have tested neither `go-fbp` nor DrawFBP.)
## Further Links
[Wikipedia: Flow-based programming](https://en.wikipedia.org/wiki/Flow-based_programming)
[John Paul Morrison: FBP](http://www.jpaulmorrison.com/fbp/)
[flowbased.org](http://flowbased.org/)
[Gopheracademy: Patterns for composable concurrent pipelines in Go](https://blog.gopheracademy.com/composable-pipelines-pattern/)
[Gopheracademy: Composable Pipelines Improved](https://blog.gopheracademy.com/advent-2015/composable-pipelines-improvements/)
**Happy coding!**
___
Updates
2022-01-08 pinned goflow to "v0" (commit 47a1b442f390) as "v1" has breaking changes
*/