This documentation is for WSO2 Stream Processor 4.2.0. View documentation for the latest release.

All docs This doc
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

A Siddhi Application is a combination of multiple Siddhi executional elements. A Siddhi executional element can be a Siddhi Query or a Siddhi Partition. When defining a Siddhi application, you can specify a number of parallel instances to be created for each executional element, and how each executional element must be isolated for an SP instance. Based on this, the initial Siddhi application is divided into multiple Siddhi applications and deployed in different SP instances.

Any standalone Siddhi Application can be converted in to a distributed Siddhi Application by adding @dist annotations. By adding these annotations, you ca divide the Siddhi Application to multiple execution groups and run them in parallel with multiple instances per group.

Annotations

The supported annotations are as follows:

Execution group annotation

Syntax@dist(execGroup=’name of the group<string>’)
DescriptionThis annotation specifies the execution groups. An execution group is a collection of queries that is executed as a single unit. You can add this annotation at the query level and specify a name for the group. Queries with same execution group name are considered as part of the same group. If you do not specify an execution group name, system generated name with the {Siddhi app name}-random UUID format is assigned. When grouping queries, the following scenarios result in validation exceptions and application deployment is halted as a result.
  • The same window being referenced from two different execution groups.
  • The same in-memory table being referenced from two different execution groups.
  • Two partitions with the same key residing within the same execution group.
  • The same stream being referenced as an unpartitioned stream within a partition and also in a normal query within a same execution group.
Applicable LevelQuery
Example
@info(name = ‘query-1')
@dist(execGroup='group-1')
from TempStream#window.time(2 min)
select avg(temp) as avgTemp, roomNo, deviceID
insert all events into AvgTempStream;

Parallel annotation

Syntax@dist(parallel=’no of parallel instances<int>’)
Description

This annotation specifies the execution parallelism of an execution group (i.e., the number of instances in which the executional elements of the execution group must be executed in parallel). Parallelism is always defined against an execution group. If no parallelism is specified, 1 is considered default. If specified, the number of instances of the execution group is created, and all these instances work in parallel. When defining parallelism, the following scenarios result in validation exceptions and application deployment is halted as a result.

  • A defined window is referenced within an execution group of which the parallelism is greater than 1.

  • An in-memory table is referenced within an execution group of which the parallelism is greater than 1.

  • A window query is included within an execution group of which the parallelism is greater than 1.

  • A pattern query within an execution group of which the parallelism is greater than 1.

  • A sequence query within an execution group of which the parallelism is greater than 1.

  • A join query within an execution group of which the parallelism is greater than 1.

  • Window within a partition query where window is not defined against the partitioned stream or the inner stream that belongs to an execution group of which the parallelism is greater than 1.

  • If different parallelism numbers are specified within the same group, application deployment is aborted with a validation exception.

Applicable LevelQuery
Example
@info(name = ‘query-3')
@dist(parallel='2')
from TempStream [(roomNo >= 100 and roomNo < 110) and temp > 40 ]
select roomNo, temp
insert into HighTempStream;

Transport channel creation annotation

Syntax@App:transportChannelCreationEnabled('true|false')
DescriptionThis annotation specifies whether Stream Processor managers are allowed to create Kafka topics that are required for application deployment. By default, this is set to true and the topics are created when the application is deployed. If the annotation is set to false, you need to create the required intermediate topics before deploying the application. If the required topics do not exist when you deploy the application, the application deployment fails, and the deployment process is aborted.
Applicable LevelApplication
Example
@App:name('wso2-app')  @App:transportChannelCreationEnabled('false')	

User given source parellelism annotation

Syntax@dist(parallel='no of parallel receiver instances<int>’)
DescriptionThis specifies the number of parallel receiver instances that should be created for a user given source. In a distributed deployment, user given sources are extracted out as separate passthrough Siddhi applications and deployed on designated receiver nodes. If adequate receiver nodes are not available, the deployment is aborted. If you have not specified a parallel count, it is defaulted to 1.  
Applicable LevelStream
Example
@source(type='inMemory', topic='stock', @map(type='json'), @dist(parallel='2'))


Example

The following is a sample distributed Siddhi application.

@App:name('wso2-app')

@Source(type=’http’,@map(type=’json),@dist(parallel=’3’))
Define stream TempStream(deviceID long, roomNo int, temp double);

@Source(type=’http’,@map(type=’json),@dist(parallel=’3’))
Define stream RegulatorStream(deviceID long, roomNo int, isOn bool);

@info(name = ‘query-1') @dist(execGroup='group-1')
from TempStream#window.time(2 min)
select avg(temp) as avgTemp, roomNo, deviceID insert all events into AvgTempStream;
 
@info(name = ‘query-2') @dist(execGroup='group-1')
from TempStream[temp > 30.0]#window.time(1 min) as T join
RegulatorStream[isOn ==false]#window.length(1) as R on T.roomNo == R.roomNo
select T.roomNo, R.deviceID, 'start' as action insert into RegulatorActionStream;
 
@info(name = ‘query-3') @dist(execGroup='group-1')
from every( e1=TempStream ) -> e2=TempStream[e1.roomNo==roomNo and (e1.temp + 5) <= temp ] within 10 min select e1.roomNo, e1.temp as initialTemp, e2.temp as finalTemp
insert into AlertStream;
 
@info(name = ‘query-4') @dist(execGroup='group-2' ,parallel ='3')
from TempStream [(roomNo >= 100 and roomNo < 110) and temp > 40 ] select roomNo, temp insert into HighTempStream;
 
@info(name = ‘query-5') @dist(execGroup='group-3' , parallel=’2’)
partition with ( deviceID of TempStream )
begin
    from TempStream#window.time(1 min)
    select roomNo, deviceID, temp, avg(temp) as avgTemp
    insert into #AvgTempStream;
 
    from #AvgTempStream[avgTemp > 20]#window.length(10)
    select roomNo, deviceID, max(temp) as maxTemp
    insert into deviceTempStream;
end;



When this Siddhi application is deployed, it is executed as shown in the table below.

Execution Group

Number of Siddhi

Application Instances

Queries executed

group1

1

query1

query2

query3

group2

2

query4

group3

2query5
  • No labels