Jelly-JVM – getting started for developers

If you don't want to code anything and only use Jelly with your Apache Jena/RDF4J application, see the dedicated guide about using Jelly-JVM as a plugin.

This guide explains a few of the basic functionalities of Jelly-JVM and how to use them in your code. The core of Jelly-JVM is written in Java, but the reactive streaming module for Apache Pekko is written entirely in Scala, along with unit and integration tests.

Quick start – Apache Jena

Depending on your RDF library of choice (Apache Jena, RDF4J, Titanium), you should import one of the dependencies: jelly-jena, jelly-rdf4j, jelly-titanium-rdf-api¹. In our examples we will use Jena, so let's add this to your build.sbt file:

MavenGradleSBT

pom.xml

<dependency>
    <groupId>eu.neverblink.jelly</groupId>
    <artifactId>jelly-jena</artifactId>
    <version>3.4.0</version>
</dependency>

build.gradle

dependencies {
    implementation "eu.neverblink.jelly:jelly-jena:${jellyVersion}"
}

build.sbt

lazy val jellyVersion = "3.4.0"
libraryDependencies ++= Seq(
  "eu.neverblink.jelly" % "jelly-jena" % jellyVersion,
)

Now you can serialize/deserialize Jelly data with Apache Jena. Jelly is fully integrated with Jena, so it should all just magically work. Here is a simple example of reading a .jelly file (in this case, a metadata file from RiverBench) with RIOT:

JavaScala

Deserialization example (Java)

import eu.neverblink.jelly.convert.jena.riot.JellyLanguage;
import org.apache.jena.rdf.model.Model;
import org.apache.jena.riot.RDFDataMgr;

// Load an RDF graph from a Jelly file
Model model = RDFDataMgr.loadModel(
  "https://w3id.org/riverbench/v/2.0.1.jelly", 
  JellyLanguage.JELLY
);

// Print the size of the model
System.out.println("Loaded an RDF graph with " + model.size() + " triples");

Deserialization example (Scala 3)

import eu.neverblink.jelly.convert.jena.riot.*
import org.apache.jena.riot.RDFDataMgr

// Load an RDF graph from a Jelly file
val model = RDFDataMgr.loadModel(
  "https://w3id.org/riverbench/v/2.0.1.jelly", 
  JellyLanguage.JELLY
)
// Print the size of the model
println(s"Loaded an RDF graph with ${model.size} triples")

Serialization is just as easy:

JavaScala

Serialization example (Java)

import eu.neverblink.jelly.convert.jena.riot.JellyLanguage;
import org.apache.jena.rdf.model.Model;
import org.apache.jena.riot.RDFDataMgr;

import java.io.FileOutputStream;

// Omitted here: creating an RDF model.
// You can use the one from the previous example.

try (FileOutputStream out = new FileOutputStream("metadata.jelly")) {
  // Write the model to a Jelly file
  RDFDataMgr.write(out, model, JellyLanguage.JELLY);
  System.out.println("Saved the model to metadata.jelly");
}

Serialization example (Scala 3)

import eu.neverblink.jelly.convert.jena.riot.*
import org.apache.jena.riot.RDFDataMgr

import java.io.FileOutputStream
import scala.util.Using

// Omitted here: creating an RDF model.
// You can use the one from the previous example.

Using.resource(new FileOutputStream("metadata.jelly")) { out =>
  // Write the model to a Jelly file
  RDFDataMgr.write(out, model, JellyLanguage.JELLY)
  println("Saved the model to metadata.jelly")
}

Use Jelly-JVM with Apache Jena

Use Jelly-JVM with RDF4J

Use Jelly-JVM with Titanium RDF API

Quick start – Titanium RDF API

If you aren't using a big RDF library like Jena or RDF4J, the simplest way to get started is to use the Titanium RDF API:

MavenGradleSBT

pom.xml

<dependency>
    <groupId>eu.neverblink.jelly</groupId>
    <artifactId>jelly-titanium-rdf-api</artifactId>
    <version>3.4.0</version>
</dependency>

build.gradle

dependencies {
    implementation "eu.neverblink.jelly:jelly-titanium-rdf-api:${jellyVersion}"
}

build.sbt

lazy val jellyVersion = "3.4.0"
libraryDependencies ++= Seq(
  "eu.neverblink.jelly" % "jelly-titanium-rdf-api" % jellyVersion,
)

You can write a Jelly file like this, using the simple RdfQuadConsumer interface:

Titanium writer example (Java)

var writer = TitaniumJellyWriter.factory(outputStream);
writer.quad(subject, predicate, object, ...);

Where outputStream is a Java OutputStream hooked up to, for example, a file on disk.

And read it like this, pointing the reader to an RdfQuadConsumer:

Titanium reader example (Java)

var reader = TitaniumJellyReader.factory();
reader.parseAll(quadConsumer, inputStream);

In this way, you can simply convert between Jelly, JSON-LD, CBOR-LD, N-Quads and other libraries supporting the RdfQuadConsumer interface.

More on using Jelly-JVM with Titanium RDF API

RDF Pekko Streams

Now, the real power of Jelly lies in its streaming capabilities. Not only can it stream individual RDF triples/quads (this is called flat streaming), but it can also very effectively handle streams of RDF graphs or datasets. To work with streams, you need to use the jelly-pekko-stream module, which is based on the Apache Pekko Streams library. So, let's update our dependencies:

MavenGradleSBT

pom.xml

<dependency>
    <groupId>eu.neverblink.jelly</groupId>
    <artifactId>jelly-pekko-stream_3</artifactId>
    <version>3.4.0</version>
</dependency>

build.gradle

dependencies {
    implementation "eu.neverblink.jelly:jelly-pekko-stream_3:${jellyVersion}"
}

build.sbt

lazy val jellyVersion = "3.4.0"

libraryDependencies ++= Seq(
  "eu.neverblink.jelly" %% "jelly-jena" % jellyVersion,
  "eu.neverblink.jelly" %% "jelly-pekko-stream" % jellyVersion,
)

Now, let's say we have a stream of RDF graphs – for example each graph corresponds to one set of measurements from an IoT sensor. We want to have a stream that turns these graphs into their serialized representations (byte arrays), which we can then send over the network. Here is how to do it:

Apache Pekko Reactive streaming example (Scala 3)

// We need to import "jena.given" for Jena-to-Jelly conversions
import eu.neverblink.jelly.convert.jena.{JenaAdapters, JenaConverterFactory}
import eu.neverblink.jelly.convert.jena.riot.*
import eu.neverblink.jelly.core.JellyOptions
import eu.neverblink.jelly.stream.*
import org.apache.jena.riot.RDFDataMgr
import org.apache.pekko.actor.ActorSystem
import org.apache.pekko.stream.scaladsl.*

import scala.concurrent.ExecutionContext

// We will need a Pekko actor system to run the streams
given actorSystem: ActorSystem = ActorSystem()

// And an execution context for the futures
given ExecutionContext = actorSystem.getDispatcher

// We will need a JenaConverterFactory to convert between Jelly and Jena
given JenaConverterFactory = JenaConverterFactory.getInstance()

// We need to import the Jena adapters to turn Model/Dataset into a stream of statements
given JenaAdapters.DATASET_ADAPTER.type = JenaAdapters.DATASET_ADAPTER
given JenaAdapters.MODEL_ADAPTER.type = JenaAdapters.MODEL_ADAPTER

// Load an RDF graph for testing
val model = RDFDataMgr.loadModel(
  "https://w3id.org/riverbench/v/2.0.1.jelly", 
  JellyLanguage.JELLY
)

Source.repeat(model) // Create a stream of the same model over and over
  .take(10) // Take only the first 10 elements in the stream
  .flatMap(model => RdfSource.builder // Convert each model to a source of triples
    .graphAsTriples(model) 
    .source
  )
  .via(EncoderFlow.graphStream( // Encode each iterable to a Jelly stream frame
    maybeLimiter = None, // 1 RDF graph = 1 message
    JellyOptions.smallStrict, // Jelly compression settings preset
  ))
  .via(JellyIo.toBytes) // Convert the stream frames to a byte arrays
  .runForeach { bytes =>
    // Just print the length of each byte array in the stream.
    // You can also hook this up to MQTT, Kafka, etc.
    println(s"Streamed ${bytes.length} bytes")
  }
  .onComplete(_ => actorSystem.terminate())

Jelly will compress this stream on-the-fly, so if the data is repetitive, it will be very efficient. If you run this code, you will notice that the byte sizes for the later graphs are smaller, even though we are sending the same graph over and over again. But, even if each graph is completely different, Jelly still should be much faster than other serialization formats.

These streams are very powerful, because they are reactive and asynchronous – in short, this means you can hook this up to any data source and any data sink – and you can scale it up as much as you want. If you are unfamiliar with the concept of reactive streams, we recommend you start with this Apache Pekko Streams guide.

Jelly-JVM supports streaming serialization and deserialization of all types of streams in the RDF Stream Taxonomy. You can read more about the theory of this and all available stream types in the Jelly protocol documentation.

Learn more about reactive streaming with Jelly-JVM

Learn more about the types of streams in Jelly

gRPC streaming

Jelly is a bit more than just a serialization format – it also defines a gRPC-based straming protocol. You can use it for streaming RDF data between microservices, to build a pub/sub system, or to publish RDF data to the web.

Learn more about using Jelly gRPC protocol servers and clients

Example applications using Jelly-JVM

The examples directory in the Jelly-JVM repo contains code snippets that demonstrate how to use the library in various scenarios.
jelly-cli command-line utility can help you convert to/from Jelly, as well as validate and debug Jelly files.
Nanopub Registry and Query are production applications of Jelly. They use Jelly-JVM for inter-service communication, using the RDF4J integration.
RiverBench ci-worker – a real-world application that is used for processing large RDF datasets in a CI/CD pipeline. It uses Jelly-JVM for serialization and deserialization with Apache Jena. It also uses extensively Apache Pekko Streams.
Jelly JVM benchmarks – research software for testing the performance of Jelly-JVM and other RDF serializations in Apache Jena. It uses most of Jelly-JVM's features.

Questions?

If you have any questions about using Jelly-JVM, feel free to open an issue on GitHub.

There is nothing stopping you from using more than one at the same time. You can also pretty easily add support for any other Java-based RDF library by implementing a few interfaces. More details here. ↩

Jelly-JVM – getting started for developers

Quick start – Apache Jena

Quick start – Titanium RDF API

RDF Pekko Streams

gRPC streaming

Further reading

Example applications using Jelly-JVM

Questions?