Replica support in Participant

In the current implementation, ACM supports multi-participant with same supported element Type but different participantId, so they need different properties file.

In order to support replica, it needs to support multi-participant using same properties file.

Solution 1:

Add dynamic participantId support.

Changes in Participant:

UUID participantId will be generated in memory instead to fetch it in properties file.
cosumerGroup will be empty (kafka configuration): any participant will have unique Kafka queue.

Changes in ACM-runtime:

When participant go OFF_LINE:
- if there are compositions connected to that participant, ACM-runtime will find other ON_LINE participant with same supported element type;
- if other ON_LINE participant is present it will change the connection with all compositions and instance;
- after that, it will execute restart for all compositions and instances to the ON_LINE participant.
When receive a participant REGISTER:
- it will check if there are compositions connected to a OFF_LINE participant with same supported element type;
- if there are, it will change the connection with all compositions and instances to that new registered participant;
- after that it will execute restart for all compositions and instances changed.

Note:

In a scenario of high number of compositions, if participant is restarting it will be slow-down the restarting action: AC-runtime will send a message for each composition primed and instance deployed to the participant.
The restarting action, will be not necessary with the next solution.

Solution 2:

Add a no-sql database support for participant.

Changes in Participant:

Add client support for no-sql database
Add repository support
Refactor CacheProvider to support insert/update
Add Transactional support
Refactor Participants that are using own cache in memory (Policy Participant saves policy and policy type in memory)

Changes in docker/Kubernetes environment

Refactor CSIT to support no-sql database
Refactor performance and stability test to support no-sql database
Refactor OOM to support no-sql database

Changes in ACM-runtime:

Refactor restarting scenario to apply the restarting only for compositions and instances in transition

Database in ADP marketplace:

Distributed Coordinator ED (Etcd): Distributed systems use etcd as a consistent key-value store for configuration management, service discovery, and coordinating distributed work. Many organizations use etcd to implement production systems such as container schedulers, service discovery services, and distributed data storage.
Document Database PG (PostgreSql)
Key Value Database AG (Apache Geode): Apache Geode provides a database-like consistency model, reliable transaction processing and a shared-nothing architecture to maintain very low latency performance with high concurrency processing.(https://geode.apache.org/docs/guide/114/getting_started/intro_to_clients.html) or (https://docs.spring.io/spring-boot-data-geode-build/1.7.5/reference/html5/).
Key Value Database RD (Redis): RDB is NOT good if you need to minimize the chance of data loss in case Redis stops working.
Wide Column Database CD (Apache Cassandra)
Search Engine (Open Search): It supports Rest Api, https://opensearch.org/docs/latest/clients/java/

Space shortcuts

Page tree

Solution 1:

Changes in Participant:

Changes in ACM-runtime:

Note:

Solution 2:

Changes in Participant:

Changes in docker/Kubernetes environment

Changes in ACM-runtime:

Database in ADP marketplace: