Available options

Available options
Prev	#OPTION	Next

The following options are generally useful:

maxRunTime	Default: none	Sets the maximum number of seconds a job runs before it times out
freezePersists	Default: false	If true, does not calculate/recalculate PERSISTed
expirePersists	Default: true	If true, PERSISTs expire after the specified period. This is set in the Sasha configuration setting (PersistExpiryDefault) or using #option ('defaultPersistExpiry', n) where n is the number of days.
defaultPersistExpiry	Default: none	If set, PERSISTs expire after the number of days specified (overriding the Sasha PersistExpiryDefault setting).
multiplePersistInstances	Default: true	If true, multiple PERSISTs are the default.
defaultNumPersistInstances	Default: none	Specifies the default number of PERSISTs. A value of -1 specifies that all copies should be kept until they expire or manually deleted.
check	Default: true	If true, check for potential overflows of records.
expandRepeatAnyAsDfa	Default: true	If true, expand ANY* in a DFA.
forceFakeThor	Default: false	If true, force code to use hthor.
forceGenerate	Default: false	If true, force .SO to be generated even if it's not worth it
globalFold	Default: true	If true, perform a global constant fold before generating.
globalOptimize	Default: false	If true, perform a global optimize.
groupAllDistribute	Default: false	If true, GROUP,ALL generates a DISTRIBUTE instead of a global SORT.
maximizeLexer	Default: false	If true, maximize the amount of work done in the lexer.
maxLength	Default: 4096	Specify maximum length of a record.
minimizeSpillSize	Default: false	If true, if a spill is filtered/deduped etc when read, reduce spill file size by splitting, filtering and then writing.
optimizeGraph	Default: true	If true, optimize expressions in a graph before generation
orderDiskFunnel	Default: true	If true, if all inputs to a funnel are disk reads, pull in
parseDfaComplexity	Default: 2000	Maximum complexity of expression to convert to a DFA.
pickBestEngine	Default: true	If true, use hthor if it is more efficient than Thor
diskReadsAreSimple	Default: true	If true, modifies the behavior of the pickBestEngine option so disk read operations are regarded the same as index read operations when deciding whether Thor is needed. The benefit is that simple jobs can run on hthor reading/filtering data remotely using dafilesrv.
targetClusterType	hthor\|Thor\|roxie	What supercomputer type are we generating code for?
topnLimit	Default: 10000	Maximum number of records to do topN on.
outputLimit	Default: 10	Sets maximum size (in Mb) of result stored in workunit.
sortIndexPayload	Default: true	Specifies sorting (or not) payload fields
workflow	Default: true	Specifies enabling/disabling workflow services.
foldStored	Default: false	Specifies that all the stored variables are replaced with their default values, or values overridden by #stored. This can significantly reduce the size of the graph generated.
skipFileFormatCrcCheck	Default: false	Specifies that the CRC check on indices produces a warning and not an error.
allowedClusters	Default: none	Specifies the comma-delimited list of cluster names (as a string constant) where the workunit may execute. This allows the job to be switched between clusters, manually or automatically, if the workunit is blocked on its assigned cluster and another valid cluster is available for use.
AllowAutoQueueSwitch	Default: false	If true, specifies that the workunit is automatically re-assigned to execute on another available cluster listed in allowedClusters when blocked on its assigned cluster.
performWorkflowCse	Default: false	If true, specifies that the code generator automatically detects opportunities for Common Sub-expression Elimination that may be "buried" within multiple PERSISTed attributes. If false, notification of these opportunities are displayed to the programmer as suggestions for the use of the INDEPENDENT Workflow Service.
defaultSkewError	Default: none	A value between 0.0 and 1.0 that determines the amount of skew needed to generate a skew error. This value is ignored if the ECL has provided a SKEW attribute.
defaultSkewWarning	Default: none	A value between 0.0 and 1.0 that determines the amount of skew needed to generate a skew warning. If set higher than defaultSkewError, then the value is ignored.
overrideSkewError	Default: none	If set to a value between 0.0 and 1.0, it overrides any ECL SKEW(nn) attribute values in the current job.
defaultSkewThreshold	Default: 1GB	The size of the dataset (in bytes) local to a single node needed before Skew errors/warnings are generated if no THRESHOLD(nn) was supplied in ECL.
overrideSkewThreshold	Default: none	The size of the dataset (in bytes) local to a single node needed before Skew errors/warnings are generated. Overrides any ECL THRESHOLD(nn) attribute values in the current job.
applyInstantEclTransformations	Default false	Limit non-file outputs with a CHOOSEN
applyInstantEclTransformationsLimit	Default 100	Number of records to limit to
divideByZero	Default zero	'zero' evaluates to 0, the default behavior. 'fail' causes the job to fail and report a division by zero error. 'nan' (only currently supported for real numbers) creates a quiet NaN, which will propagate through any real expressions it is used in. You can use NOT ISVALID(x) to test if the value is a NaN. Integer and decimal division by zero continue to return 0.
outputLimitMb	Default 10 [MB]	Limit of output to a workunit in MB.
hthorMemoryLimit	Default 300 [MB]	Override memory usage limit set in ECL Agent's defaultMemoryLimitMB configuration option (for hThor only).
maxCsvRowSizeMb	Default 10 [MB]	Upper limit of a CSV line read in MB.
validateFileType	Default true	If false, the engines use the definition in the ECL workunit and ignore the file type from the logical file meta data. If true, this check is always ignored if the ECL is reading a CSV or a fixed record width flat file. Also when true, if the ECL is reading XML or JSON, and there is a mismatch, it issues a warning not an error.
compressInternalSpills	Default true	Compress internal spills. (e.g., spills created by lookahead or sort gathering).
hdCompressorType	Default 'FLZ'	Distribute compressor to use.
hdCompressorOptions	Default ''	Distribute compressor options (e.g., AES key)
splitterSpill	Default -1	Integer value to indicate whether to force splitters to spill or not. [1 = force spill \| 0 = force in memory \| -1 = adhere to helper setting ]
loopMaxEmpty	Default 1000	Max # of iterations that LOOP can cycle through without results before reporting an error
smallSortThreshold	Default 0 (disabled)	If estimated size is below this threshold in bytes, a minisort approach should be used.
sort_max_deviance	Default 10 [MB]	Max (byte) variance allowed during sort partitioning
joinHelperThreads	Default = same as number of cores	Number of threads to use in threaded variety of join helper
bindCores	Default = 0	For Roxie queries. If non-zero, binds the query to only use the specified number of cores. This overrides the value set for coresPerQuery in Roxie configuration.
translateDFSlayouts	Default = 0	Specifies that file layouts should be looked up at compile time. See File Layout Resolution at Compile Time in the Programmer's Guide for more details.
timeLimit		For Roxie queries. Maximum run time (in ms) for a query.
generateGlobalId	Default = false	For Roxie queries. When true, generates a unique GlobalId if one is not provided.
analyzeWorkunit		Overrides the setting in ECL Agent to analyze workunits after ECL queries are executed (Thor only). This allows a workunit to be further analyzed to identify and display any potential issues. These possible issues display in ECL Watch's "Warnings & Errors" area. The global setting defaults to TRUE, but can be changed using Configuration Manager.
maxCost	Default: none	Overrides the limit setting in Thor's configuration. If the maxCost threshold is reached, the job guillotine is enforced and the job is halted. This does not override the hardlimit setting. This is only valid for Thor jobs.

The following options are all about generating Logical graphs in a workunit.

Logical graphs are stored in the workunit and viewed in ECL Watch. They include information about which attribute/line number/column the symbols are defined in. Exported attributes are represented by <module>.<attribute> in the header of the activity. Non-exported (local) attributes are represented as <module>.<exported-attribute>::<non-exported-name>

generateLogicalGraph	Default: false	If true, generates a Logical graph in addition to all the workunit graphs.
generateLogicalGraphOnly	Default: false	If true, generates only the Logical graph for the workunit.
logicalGraphExpandPersist	Default: true	If true, generates expands PERSISTed attributes.
logicalGraphExpandStored	Default: false	If true, generates expands STORED attributes.
logicalGraphIncludeName	Default: true	If true, generates attribute names in the header of the activity boxes.
logicalGraphIncludeModule	Default: true	If true, generates module.attribute names in the header of the activity boxes.
logicalGraphDisplayJavadoc	Default: true	If true, generates the Javadoc-style comments embedded in the ECL in place of the standard text that would be generated (see http://java.sun.com/j2se/javadoc/writingdoccomments/). Javadoc-style comments on RECORD structures or scalar attributes will not generate, as they have no graph Activity box directly associated.
logicalGraphDisplayJavadocParameters	Default: false	If true, generates information about parameters in any Javadoc-style comments.
filteredReadSpillThreshold	Default: 2	Filtered disk reads are spilled if will be duplicated more than N times.
foldConstantCast	Default: true	If true, (cast)value is folded at generate time.
foldFilter	Default: true	If true, filters are constant folded.
foldAssign	Default: true	If true, TRANSFORMs are constant folded.
foldSQL	Default: true	If true, SQL is constant folded.
optimizeDiskRead	Default: true	If true, include project and filter in the transform for a disk read.
optimizeSQL	Default: false	If true, optimize SQL.
optimizeThorCounts	Default: true	If true, convert COUNT(diskfile) into optimized version.
peephole	Default: true	If true, peephole optimize memcpy/memsets, etc.
spotCSE	Default: true	If true, look for common sub-expressions in TRANSFORMs/filters.
noteRecordSizeInGraph	Default: true	Add estimates of record sizes to the graph
showActivitySizeInGraph	Default: false	Show estimates of generated C++ size in the graph
showMetaInGraph	Default: false	Add distribution/sort orders to the graph
showRecordCountInGraph	Default: true	Show estimates of record counts in the graph

spotTopN	Default: true	If true, convert CHOOSEN(SORT()) into a topN activity.
spotLocalMerge	Default: false	If true, if local JOIN and both sides are sorted, generate a light-weight merge.
countIndex	Default: false	If true, optimize COUNT(index) into optimized version (also requires optimizeThorCounts).
allowThroughSpill	Default: true	If true, allow through spills.
optimizeBoolReturn	Default: true	If true, improve code when returning BOOLEAN from a function.
optimizeSubString	Default: true	If true, don't allocate memory when doing a substring.
thorKeys	Default: true	If true, allow INDEX operations in Thor.
regexVersion	Default: 0	If set to 1, specifies use of the previous regular expression implementation, which may be faster but also may exceed stack limits.
compileOptions	Default: none	Specify override compiler options (such as /Zm1000 to double the compiler heap size to workaround a heap overflow error).
linkOptions	Default: none	Specify override linker options.
optimizeProjects	Default: true	If false, disables automatic field projection/distribution optimization.
notifyOptimizedProjects	Default: 0	If set to 1, reports optimizations to named attributes. If set to 2, reports all optimizations.
optimizeProjectsPreservePersists	Default: false	If true, disables automatic field projection/distribution optimization around reading PERSISTed files. If a PERSISTed file is read on a different size cluster than it was created on, optimizing the projected fields can mean that the distribution/sort order cannot be recreated.
aggressiveOptimizeProjects	Default: false	If true, enables attempted minimization of network traffic for sorts/distributes. This option doesn't usually result in significant benefits, but may do so in some specific cases.
percolateConstants	Default: true	If false, disables attempted aggressive constant value optimizations.

The following options are useful for debugging:

debugNlp	Default: false	If true, output debug information about the NLP processing to the .cpp file.
resourceMaxMemory	Default: 400M	Maximum amount of memory a subgraph can use.
resourceMaxSockets	Default: 2000	Maximum number of sockets a subgraph can use.
resourceMaxActivities	Default: 200	Maximum number of activities a subgraph can contain.
unlimitedResources	Default: false	If true, assume lots of resources when resourcing the graphs.
traceRowXML	Default: false	If true, turns on tracing in ECL Watch graphs. This should only be used with small datasets for debugging purposes.
_Probe	Default: false	If true, display all result rows from intermediate result sets in the graph in ECL Watch when used in conjunction with the traceRowXML option. This should only be used with small datasets for debugging purposes.
debugQuery	Default: false	If true, compile query using debug settings.
optimizeLevel	Default: 3 for roxie, else 0	Set the C++ compiler optimization level (optimizations can cause the compiler to take a lot longer).
checkAsserts	Default: true	If true, enables ASSERT checking.
soapTraceLevel	Default: 1	The level of detail in reporting SOAPCALL or HTTPCALL information (set to 0 for none, 1 for normal, 2 - 8 for more detail)
traceEnabled	Default: FALSE	Enables tracing to log files when TRACE actions are present. See TRACE.
traceLimit	Default: 10	Overrides the the default KEEP setting for a TRACE statement to indicate how many TRACE statement to write to log file. See TRACE.
maxlogdetail		Overrides the the default logging level for a single workunit. This allows logging levels to be set to a low level by default, but allow jobs to be resubmitted with a higher logging level for investigation.

The following options are for advanced code generation use:

These options should be left alone unless you REALLY know what you are doing. Typically they are used internally by our developers to enable/disable features that are still in development. Occasionally the technical support staff will suggest that you change one of these settings to work around a problem that you encounter, but otherwise the default settings are recommended in all cases.

filteredReadSpillThreshold	Default: 2	Filtered disk reads are spilled if will be duplicated more than N times.
foldConstantCast	Default: true	If true, (cast)value is folded at generate time.
foldFilter	Default: true	If true, filters are constant folded.
foldAssign	Default: true	If true, TRANSFORMs are constant folded.
foldSQL	Default: true	If true, SQL is constant folded.
optimizeDiskRead	Default: true	If true, include project and filter in the transform for a disk read.
optimizeSQL	Default: false	If true, optimize SQL.
optimizeThorCounts	Default: true	If true, convert COUNT(diskfile) into optimized version.
peephole	Default: true	If true, peephole optimize memcpy/memsets, etc.
spotCSE	Default: true	If true, look for common sub-expressions in TRANSFORMs/filters.
spotTopN	Default: true	If true, convert CHOOSEN(SORT()) into a topN activity.
spotLocalMerge	Default: false	If true, if local JOIN and both sides are sorted, generate a light-weight merge.
countIndex	Default: false	If true, optimize COUNT(index) into optimized version (also requires optimizeThorCounts).
allowThroughSpill	Default: true	If true, allow through spills.
optimizeBoolReturn	Default: true	If true, improve code when returning BOOLEAN from a function.
optimizeSubString	Default: true	If true, don't allocate memory when doing a substring.
thorKeys	Default: true	If true, allow INDEX operations in thor.
regexVersion	Default: 0	If set to 1, specifies use of the previous regular expression implementation, which may be faster but also may exceed stack limits.
compileOptions	Default: none	Specify override compiler options (such as /Zm1000 to double the compiler heap size to workaround a heap overflow error).
linkOptions	Default: none	Specify override linker options.
optimizeProjects	Default: true	If false, disables automatic field projection/distribution optimization.
notifyOptimizedProjects	Default: 0	If set to 1, reports optimizations to named attributes. If set to 2, reports all optimizations.
optimizeProjectsPreservePersists	Default: false	If true, disables automatic field projection/distribution optimization around reading PERSISTed files. If a PERSISTed file is read on a different size cluster than it was created on, optimizing the projected fields can mean that the distribution/sort order cannot be recreated.
aggressiveOptimizeProjects	Default: false	If true, enables attempted minimization of network traffic for sorts/distributes. This option doesn't usually result in significant benefits, but may do so in some specific cases.
percolateConstants	Default: true	If false, disables attempted aggressive constant value optimizations.
exportDependencies	Default: false	Generate information about inter-definition dependencies
maxCompileThreads	Default 4 for eclccserver and 1 for eclcc	Number of compiler instances to compile the C++
reportCppWarnings	Default: false	Report warnings from C++ compilation
saveCppTempFiles	Default: false	Retain the generated C++ files
spanMultipleCpp	Default: true	Generate a work unit in multiple C++ files
activitiesPerCpp	Default 500 for Linux or 800 for Windows	Number of activities in each C++ file (requires spanMultipleCpp)
obfuscateOutput	Default false	If true, details are removed from the generated workunit, including ECL code, estimates of record size, and number of records.

The following options are for the workunit analyzer:

analyzeWorkunit	Default: true	If set to FALSE, disables analysis of the workunit
analyzer_minInterestingTime	Default: 1000	Analyze activities that exceed this minimum time to execute (milliseconds)
analyzer_minInterestingCost	Default: 30000	Report issues where the time penalty exceeds this value (milliseconds)
analyzer_skewThreshold	Default: 20	Report skew related issues that exceed this threshold
analyzer_minRowsPerNode	Default: 1000	Ignore activities that have this average number of rows per node

Examples:

#OPTION('traceRowXml', TRUE);
#OPTION('_Probe', TRUE);

my_rec := RECORD
  STRING20 lname;
  STRING20 fname;
  STRING2 age;
END;
  
d := DATASET([{ 'PORTLY', 'STUART' , '39'},
              { 'PORTLY', 'STACIE' , '36'},
              { 'PORTLY', 'DARA' , ' 1'},
              { 'PORTLY', 'GARRETT', ' 4'}], my_rec);
  
OUTPUT(d(d.age > ' 1'), {lname, fname, age} );

//************************************
//This example demonstrates Logical Graphs and
// Javadoc-style comment blocks
#OPTION('generateLogicalGraphOnly',TRUE);
#OPTION('logicalGraphDisplayJavadocParameters',TRUE);
/**
 * Defines a record that contains information about a person
*/
namesRecord := RECORD
  string20    surname;
  string10    forename;
  integer2    age := 25;
END;
  
/**
Defines a table that can be used to read the information from the file
and then do something with it.
*/
namesTable := DATASET('x',namesRecord,FLAT);
  
  
/**
 Allows the name table to be filtered.
 @param ages        The ages that are allowed to be processed.
 @param badForename Forname to avoid.
 @return the filtered dataset.
*/
namesTable filtered(SET OF INTEGER2 ages, STRING badForename) :=
                    namesTable(age in ages, forename != badForename);
OUTPUT(filtered([10,20,33], ''));