As far as I understand it can be done using VPBROADCASTD instruction.
Detailed information can be found in Intel Xeon Phi Coprocessor Instruction Set Architecture Reference Manual (http://download-software.intel.com/sites/default/files/forum/278102/327364001en.pdf), page 459.
Also you can use Intel C++ Compiler or Intel Fortran Compiler with automatic vectorization. In that case the compiler will care what SIMD instructions to use. Just use -O2 key in the compiler command line and #pragma simd before the loop. Of course, we need to care about simple enough loop structure and avoid data dependencies. Additional information can be found here: http://software.intel.com/mic-developer, training, advanced videos, vectorization.