Faunus: Graph Analytics Engine
http://faunus.thinkaurelius.com
[[https://raw.github.com/thinkaurelius/faunus/master/doc/images/graph-globe.png|width=100px]]
```xml
com.thinkaurelius.faunus
faunus
0.4.4
```
Incremented Titan version
```xml
com.thinkaurelius.faunus
faunus
0.4.3
```
- Added error handling for invalid RDF parameters
- Added support for incremental edge loading with
BlueprintsGraphOutputMapReduce
- Bumped to support Titan 0.4.3
```xml
com.thinkaurelius.faunus
faunus
0.4.2
```
- Added Titan ElasticSearch dependency so its available in the Hadoop job jar
- Bumped to support Titan 0.4.2
```xml
com.thinkaurelius.faunus
faunus
0.4.1
```
- Fixed a severe bug in
Configuration
entry orderings andMapSequence
- Changed default Cassandra partitioner from Random to Murmur3
- Broke
BlueprintsGraphOutputMapReduce
into a MapReduce and then a Map (speeds up edge loading)
```xml
com.thinkaurelius.faunus
faunus
0.4.0
```
- Removed support for Java 1.6 (Java 1.7+ supported)
- Bumped to support Hadoop 1.2.1
- Bumped to support Titan 0.4.0
- Bumped to support TinkerPop 2.4.0
- Renamed
FaunusRexsterExtension
toFaunusRexsterInputFormatExtension
- Added
FaunusRexsterExecutorExtension
which allows remote execution of a Faunus script and tracking of its progress - Global GremlinFaunus variables are now available in
ScriptEngine
use cases - Reduced the width of the logger output of Hadoop via a new
log4j.properties
file - Simplified
ResultHookClosure
with new Gremlin 2.4.0 classes - Fixed
has(key,null)
NullPointerException
issue - Added support for
has(key)
andhasNot(key)
- Migrated from
Query.Compare
toCompare
with Blueprints 2.4.0 - The variables
hdfs
andlocal
are available togremlin.sh -e
- Removed
SequenceFile
migration model via Faunus (unsustainable)
```xml
com.thinkaurelius.faunus
faunus
0.3.2
```
- Fixed a bug in
SchemaInferencerMapReduce
withEmptyConfiguration
- User can now provide their own
mapred.jar
to deploy to the Hadoop cluster - Graph database bulk loading is now fail safe as transaction size is the size of the input split
- Incremental bulk loading is now possible w/
loading-from-scratch=false
inBlueprintsGraphOutputMapReduce
- Number of reduce tasks to use during bulk and incremental loading is computed as a function of input split size
- Added more useful
PathFilter
instances - Generalized
HDFSTools
for more generic use throughout Faunus - Added
JobConfigurationFormat
which supports dynamic job manipulation at execution time - Bumped to support Titan 0.3.2
- Fixed a bug in
CountMapReduce
where a value of 0 was not displayed - Added
FileSystem.cp()
to support copying paths within a filesystem
```xml
com.thinkaurelius.faunus
faunus
0.3.1
```
- Optimized the default in-memory filter for
VertexQueryFilter
- Bumped to support Titan 0.3.1
- Updated all
ByteBuffers
toStaticByteBuffers
(and associated updates)
```xml
com.thinkaurelius.faunus
faunus
0.3.0
```
- Skipped Faunus 0.2.y line in order to synchronize with Titan 0.3.z
- Bumped to support Titan 0.3.0
- Bumped to support Hadoop 1.1.2
- Moved
IdentityMap
to base directory as changed in Pipes 2.3.0 - Refactored the means by which MapReduce test cases are written
FaunusVertex
property value serialization support using Kryo 2.2.1FaunusVertex01
provides history deserialization support for Faunus 0.1.z → Faunus 0.3.z conversion- Moved
util.MicroElement
classes into respectiveFaunusElement
inner classes - Added numerous test cases for file-based
RecordReader
formats - Added
ScriptInputFormat
to support the arbitrary reading of an HDFS file lines into aFaunusVertex
- Added
ScriptOutputFormat
to support the arbitrary writing of an HDFS file line from aFaunusVertex
- Introduced the
faunus.pipeline
property namespace for properties that are global to the pipeline - Added support for vertex-centric “filters” for isolating subsets of a graph
InputFormat
- Fixed a severe bug in the propagation of
Configurations
throughout a Faunus job sequence (EmptyConfiguration
) - Renamed
GraphSONUtility
toFaunusGraphSONUtility
to avoid name conflicts with Blueprints - Added
EdgeCopyMapReduce
to support graph transpose for bidirectional edges FaunusGraph
now implementsConfigurable
and thusgetConf()
instead ofgetConfiguration()
- Removed unnecessary combiner steps when the combiner only groups messages without compressing them
```xml
com.thinkaurelius.faunus
faunus
0.1.1
```
OrderMapReduce
semantics updated for number based properties (multiply by path count for efficiency)- Improved the memory consumption and speed of
MemoryMapper
- Bumped to support Hadoop 1.1.1
- Bumped to support Titan 0.2.1
- Bumped to support OpenRDF 2.6.10
- Bumped to support TinkerPop 2.3.0
- Added
script
-step which allows for arbitrary Gremlin/Groovy script executions on Faunus vertices - Removed
step
-step as it was not clearly defined and difficult to use (seescript
-step) - Fixed job jar finding issue by introducing
FAUNUS_HOME
system property - Complete refactoring of
FaunusCompiler
andFaunusPipeline
- Added support for user defined
Input
- andOutputFormats
- Added
MapReduceFormat
which allows an arbitrary number of MapReduce jobs after/before input/output - Added configuration to allow Faunus to get the actual vertex count from Rexster when determining split size
- Added configuration to allow Faunus to authenticate to Rexster, if Rexster security is turned on
- Removed dependency on
Tokens.F
with the inclusion of ordering in TinkerPop Pipes
```xml
com.thinkaurelius.faunus
faunus
0.1.0
```
- First release and therefore, no changes.