Knowledge Platform

Play Framework APIs for the Sunbird Knowledge Platform. Each service exposes REST endpoints for managing content, taxonomy, search, and assessments, backed by JanusGraph, YugabyteDB, Elasticsearch, and Redis.

Modules

Module	Description
`platform-core`	Shared libraries: graph engine, schema validators, actors, cloud storage
`ontology-engine`	Graph operations for content, taxonomy, and assessment nodes
`content-api/content-service`	Content and collection CRUD, hierarchy, publishing triggers
`taxonomy-api/taxonomy-service`	Frameworks, categories, terms, channels, licenses
`search-api/search-service`	Composite search across the knowledge graph via Elasticsearch
`assessment-api/assessment-service`	QuestionSets and assessment items
`knowlg-service`	Aggregator service bundling Content, Taxonomy, and Assessment APIs into a single runtime
`platform-modules`	MIME-type management, URL management, and import utilities

Prerequisites

Make sure these are installed before you begin:

Java 11 — verify with java -version
Maven 3.9+ — verify with mvn -version
Docker Desktop — verify with docker --version
- Allocate at least 6 GB RAM to Docker Desktop (Settings > Resources > Memory). The default 3.8 GB is not enough.
Git — verify with git --version

Local Development Setup

Follow these steps in order. The full setup takes about 5 minutes.

Step 1 — Clone the repository

git clone https://github.com/Sunbird-Knowlg/knowledge-platform.git
cd knowledge-platform

Step 2 — Start infrastructure

cd docker
docker compose up -d

This starts YugabyteDB, JanusGraph, Elasticsearch, and Kafka.

Wait about 90 seconds for everything to initialize (YugabyteDB starts first, then JanusGraph connects to it and creates the graph schema). You can check progress with:

docker compose ps                  # all containers should show "Up"
docker logs janusgraph | grep "SCHEMA INITIALIZATION"
# Expected: --- SCHEMA INITIALIZATION COMPLETE ---

Step 3 — Initialize YugabyteDB keyspaces

Still inside the docker/ directory, run the migration script to create the required keyspaces and tables:

./init-yugabyte.sh

This downloads CQL migration files from sunbird-spark-installer and executes them. By default it uses dev as the keyspace prefix (e.g. dev_content_store) and the develop branch.

./init-yugabyte.sh sb           # use 'sb' as keyspace prefix instead
./init-yugabyte.sh dev main     # use a different branch

You only need to run this once. Run it again after docker compose down -v (which deletes volumes).

Step 4 — Initialize Elasticsearch indices

Still inside the docker/ directory, run the Elasticsearch init script to create the required indices and mappings:

./init-elasticsearch.sh

This downloads index and mapping definitions from sunbird-devops and applies them via the Elasticsearch REST API. By default it uses the release-8.0.0 branch.

./init-elasticsearch.sh release-9.0.0    # use a different branch

You only need to run this once. Run it again after docker compose down -v (which deletes volumes).

Step 5 — Build the project

Go back to the repository root and build:

cd ..
mvn clean install -DskipTests

This takes a few minutes the first time (Maven downloads dependencies). A successful build ends with BUILD SUCCESS.

To build for a specific cloud provider:

mvn clean install -DskipTests -Paws   # AWS S3
mvn clean install -DskipTests -Pgcp   # Google Cloud Storage
mvn clean install -DskipTests -Poci   # Oracle Cloud Infrastructure

Step 6 — Run a service

Required: Set cloud storage environment variables before starting any service. The StorageModule initializes eagerly on startup and the service will fail if the variables are empty. If you don't have real credentials, set placeholder values — storage will only fail when you actually upload/download content:
export cloud_storage_type=azure
export cloud_storage_key=placeholder
export cloud_storage_secret=placeholder
export cloud_storage_container=placeholder

You can either run services individually or run Content, Taxonomy, and Assessment together via knowlg-service.

Option A — Run an individual service

Service	Module Path	Default Port
Content Service	`content-api/content-service`	9000
Search Service	`search-api/search-service`	9000
Taxonomy Service	`taxonomy-api/taxonomy-service`	9000
Assessment Service	`assessment-api/assessment-service`	9000

Example — running Taxonomy Service:

Linux:

cd taxonomy-api/taxonomy-service
mvn play2:run

macOS:

cd taxonomy-api/taxonomy-service
mvn play2:dist
cd target
tar xvzf taxonomy-service-1.0-SNAPSHOT-dist.zip
cd taxonomy-service-1.0-SNAPSHOT
./start

Option B — Run Content, Taxonomy, and Assessment together

The knowlg-service module bundles Content, Taxonomy, and Assessment into a single Play application.

Linux:

cd knowlg-service
mvn play2:run

macOS:

cd knowlg-service
mvn play2:dist
cd target
tar xvzf knowlg-service-1.0-SNAPSHOT-dist.zip
cd knowlg-service-1.0-SNAPSHOT
./start

Verify it works

curl http://localhost:9000/health

You should get a 200 OK response.

Stopping and resetting

cd docker
docker compose down            # stop containers, keep data
docker compose down -v         # stop containers and delete all data

Redis (optional)

Redis is disabled by default. All service application.conf files ship with redis.enable = false, so the services read directly from the graph database.

To enable Redis caching:

Start Redis:

cd docker
docker compose --profile redis up -d

Set redis.enable = true in the application.conf of the service you are running.

Cloud Storage Configuration

Cloud storage is needed for uploading/downloading content artifacts. If you are only testing APIs that don't involve file uploads, you can skip this.

Set these environment variables before running a service:

Azure (default)

export cloud_storage_type=azure
export cloud_storage_auth_type=ACCESS_KEY
export cloud_storage_key=your-account-name
export cloud_storage_secret=your-account-key
export cloud_storage_container=your-container-name

AWS S3

export cloud_storage_type=aws
export cloud_storage_auth_type=ACCESS_KEY
export cloud_storage_key=your-access-key-id
export cloud_storage_secret=your-secret-access-key
export cloud_storage_region=ap-south-1
export cloud_storage_container=your-s3-bucket-name

Google Cloud Storage

export cloud_storage_type=gcloud
export cloud_storage_auth_type=ACCESS_KEY
export cloud_storage_key=your-client-email
export cloud_storage_secret=/path/to/key.json
export cloud_storage_container=your-gcs-bucket-name

CI/CD — GitHub Actions

The project uses GitHub Actions for CI/CD. Workflows are defined in .github/workflows/ and triggered on tag push.

Required variables (Settings > Secrets and variables > Actions)

Variable	Description
`REGISTRY_PROVIDER`	Registry type: `gcp`, `dockerhub`, `azure`, `aws`, or `ghcr`
`REGISTRY_URL`	Container registry URL
`CLOUD_STORE_GROUP_ID`	Cloud storage SDK group ID
`ARTIFACT_ID`	Cloud storage SDK artifact ID
`VERSION`	Cloud storage SDK version

Registry credentials

GitHub Container Registry (GHCR) — default, no setup needed. Uses the built-in GITHUB_TOKEN.

DockerHub

Secret	Example
`REGISTRY_USERNAME`	`myusername`
`REGISTRY_PASSWORD`	DockerHub password or access token
`REGISTRY_NAME`	`docker.io`

Azure Container Registry

Secret	Example
`REGISTRY_USERNAME`	ACR username
`REGISTRY_PASSWORD`	ACR password
`REGISTRY_NAME`	`myregistry.azurecr.io`

GCP Artifact Registry

Secret	Example
`GCP_SERVICE_ACCOUNT_KEY`	Base64-encoded service account JSON key
`REGISTRY_NAME`	`asia-south1-docker.pkg.dev`

Amazon ECR

Secret	Example
`AWS_ACCESS_KEY_ID`	AWS access key ID
`AWS_SECRET_ACCESS_KEY`	AWS secret access key
`AWS_REGION`	`us-east-1`

Name		Name	Last commit message	Last commit date
Latest commit History 3,605 Commits
.circleci		.circleci
.claude		.claude
.github		.github
assessment-api		assessment-api
build		build
content-api		content-api
docker		docker
functional-tests		functional-tests
knowlg-automation		knowlg-automation
knowlg-service		knowlg-service
kubernetes		kubernetes
ontology-engine		ontology-engine
platform-core		platform-core
platform-modules		platform-modules
schemas		schemas
scripts		scripts
search-api		search-api
taxonomy-api		taxonomy-api
taxonomy-service-sbt		taxonomy-service-sbt
test_schema		test_schema
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
IMPLEMENTATION_VERIFICATION.md		IMPLEMENTATION_VERIFICATION.md
KNOWLG-SETUP.md		KNOWLG-SETUP.md
LICENSE		LICENSE
README.md		README.md
knowlg-docker-image.sh		knowlg-docker-image.sh
pom.xml		pom.xml
vmsetup.sh		vmsetup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge Platform

Table of Contents

Modules

Prerequisites

Local Development Setup

Step 1 — Clone the repository

Step 2 — Start infrastructure

Step 3 — Initialize YugabyteDB keyspaces

Step 4 — Initialize Elasticsearch indices

Step 5 — Build the project

Step 6 — Run a service

Option A — Run an individual service

Option B — Run Content, Taxonomy, and Assessment together

Verify it works

Stopping and resetting

Redis (optional)

Cloud Storage Configuration

Azure (default)

AWS S3

Google Cloud Storage

CI/CD — GitHub Actions

Required variables (Settings > Secrets and variables > Actions)

Registry credentials

About

Uh oh!

Releases 32

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Knowledge Platform

Table of Contents

Modules

Prerequisites

Local Development Setup

Step 1 — Clone the repository

Step 2 — Start infrastructure

Step 3 — Initialize YugabyteDB keyspaces

Step 4 — Initialize Elasticsearch indices

Step 5 — Build the project

Step 6 — Run a service

Option A — Run an individual service

Option B — Run Content, Taxonomy, and Assessment together

Verify it works

Stopping and resetting

Redis (optional)

Cloud Storage Configuration

Azure (default)

AWS S3

Google Cloud Storage

CI/CD — GitHub Actions

Required variables (Settings > Secrets and variables > Actions)

Registry credentials

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 32

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages