Related Jira(s)

CPS-1415 - Getting issue details... STATUS

Description

Define scenarios which cause a CM Handle to go stale
Implement changes to support tracking of CM Handle Freshness/Staleness

What might trigger a cmhandle to go to STALE?

dmi plugin identifies that the device is no longer contactable
dmi plugin identifies that an underlying device manager managing the device (node) is out of sync with the device itself.

Requirements

Functional

#	Interface	Requirement	Additional Information
1	CPS-NCMP-I-01	A Rest endpoint to allow DMI Plugin Reregistration, A kafka interface for DMI Plugin to provide trust level state changes for a CM Handle	Reregistration is to reregister all CMHandles managed by a CMHandle. Kafka interface schema allows for CMHandle as id and trust level as only value in data
2	DMI-I-01	A Rest endpoint to trigger DMI Plugin Reregistration	Asynchronous interaction to trigger DMI Plugin to hit endpoint in CPS-NCMP-I-01 with reregistration

Error Handling

#	Error Scenario	Expected behavior
1	DMI Plugin goes down	CMHandles managed by that DMI have NONE trust level, when the DMI comes back up, a reregistration process occurs, CMHandles are individually assessed for trust level then.
2	Node goes down	DMI Plugin informs NCMP of the trust level state change. DMI will update on changes to a cmhandles trust level change.

Capabilities

re-registration, once a day, same requirement as first time registration
single node heart beat failures 30,000 / minutes per instances

Scope

Currently only supporting NONE and COMPLETE. PARTIAL and POOR may be added later as below.

Changes to DMI Registry Model

We will not be making changes to the DMI Registry Model, Public Properties of the CMHandle will be used where needed.

Triggers of CMHandle trust level change

DMI Availability

Ping every 30 seconds configurable or (every communication with DMI OOS)

Health check endpoint already exists

http://'$1'/manage/health/readiness

Hazelcast Map

When DMI comes back up, DMI does audit and provides list of Trustworthy CM Handles

Audit triggered by NCMP with list of CMHandles IDs and for DMI to reregister HTTP

Deltas: DMI handling, properties; Delete all under a DMI? Performance?

DMI Availability	Trust Level
Available	FULL
Unavailable	NONE

Device Heart Beat

Assumed functionality is that this will be defined by the DMI Plugin as NCMP does not communicate with device directly

Interface in NCMP for DMI Plugin to be able to tell when device HB has been lost?

10 minute limit should be configurable with 10 as default.

Probably event based

Elapsed time since last DMI Plugin lost Device HB	Trust Level
Less than 10 minutes	COMPLETE
Over 10 minutes	NONE

DMI-I-01

/health

/v1/ch/trustlevel

{

"Trust":

}

Reregistration

This process occurs when the DMI Plugin Availability is down and then comes back up.
NCMP makes a synchronous call to the DMI Plugin (New Audit Endpoint) to trigger a reregistration
DMI Plugin then reregisters its CMHandles with NCMP (new reregistaration Endpoint?)
NCMP then compares the CMHandles which are being reregistered with the CMHandles which already exist.
CMHandles which are in NCMP but not in DMI reregistration request are kept as trust level none
What happens if there is conflict between the old and new properties of a CMHandle, just take the new properties?
New CMHandles could be registered

Hazelcast for Trust Level

Map Trust level for DMI Plugins
Key: Dmi Name, Values: health check url, trust level

Set Trust level for untrustworthy CMHandles
Key: CmHandleId

When checking the trust level for a CMHandle first check the trust level of that CMHandle's DMI Plugin
If None return None
If Full check trust level for the CMHandle and return that

High Level Interactions

Interface	Name	Trigger	Description	Type	Endpoint or Topic
1	HealthCheck	30 second interval (configurable)	NCMP is to perform a health check against each of the DMI Plugins	REST	/health
2	Reregistration request	DMI Plugin has gone down and comes back up	NCMP makes a call to that DMI Plugin telling it to reregister	REST	TBD
3	Reregistration	DMI Plugin received a reregistration request	DMI Plugin makes a call to NCMP to reregister its CM Handles	REST	/v1/ch/reregistration
4	CMHandle trust level change	A CMHandle managed by DMI Plugin's trust level has changed	data contains {trustLevel: ENUM} event id is cmhandle id	Kafka	TBD
5	TrustLevel Request	Client Request	TrustLevel is to be returned based on the values in above Maps	REST	TBD

Space shortcuts

Page tree

Related Jira(s)

Description

Requirements

Functional

Error Handling

Capabilities

Scope

Changes to DMI Registry Model

Triggers of CMHandle trust level change

DMI Availability

Device Heart Beat

Reregistration

Hazelcast for Trust Level

High Level Interactions

Managing TrustLevels

Space shortcuts

Page tree

CPS-1415: CM Handle Connectivity Freshness/Staleness

Related Jira(s)

Description

Requirements

Functional

Error Handling

Capabilities

Scope

Changes to DMI Registry Model

Triggers of CMHandle trust level change

DMI Availability

Device Heart Beat

Reregistration

Hazelcast for Trust Level

High Level Interactions

Managing TrustLevels