We are still helping our customers get more out of the data they have and make data driven decisions. Our new site is packed with information and insights of how data can be the catalyst to your business growth.
The traditional random number generating functions in SAS – RANUNI, RANNOR and the rest, as well as UNIFORM and NORMAL – are now deprecated, and the advice is to use the relatively new RAND function instead. Why is this?
Here first is an example of the use of the traditional functions.
Two streams of random numbers are being generated here, R1 being “uniformly” distributed and R2 “normally” distributed. The seed values – here 3 and 7, arbitrarily – are positive, which means that the numbers generated will be pseudo-random. That is, exactly the same sequences of “random” values would be generated every time the program was run. A seed that was zero or negative would generate a truly random sequence based on the system time, so that different random sequences would be generated every time the program was run.
Random number sequences generated in this way are good but not perfect. They perform pretty well against standard tests of randomness, but there is a theoretical possibility of two sequences overlapping (so that the random variables would not be statistically independent). These functions are all using excerpts from the same extremely long random sequence, and starting at a different point within it, depending on the seed value. The sequence runs to at least 264 values, so the probability of overlap is in practice small enough not to be a worry for most purposes.
In general, there is no need to rush out and alter existing code that uses the old random number functions. SAS Institute do warn that those functions are not suitable for use with parallel and distributed processing, but in most other cases old code can be safely left as it is.
For new code, RAND is better. Here is an example.
The first parameter to the RAND function – and the only one specified here – is the name of the distribution to be followed by the random values. All of the distributions for which there were “old” random number functions are supported.
Notice that the RAND calls do not specify a seed. The equivalent is done by the single call to STREAMINIT at the beginning. Note that:
The benefits of using the RAND function are:
CALL STREAMINIT permits a choice of random number generators. The default, suitable for most purposes, is called “MTHybrid” and described as “Hybrid 1998/2002 32-bit Mersenne twister”. Others available are described as 64-bit, “Threefry”, “Threefish” etc. (Perhaps with chips?)
There are two related CALL routines, neither of which will normally be required: