优化

Expected performance gain ... 5960X vs Xeon Phi?

Hello... 
I am a retired theoretical physical chemist with a long association with computers and computing.
As briefly as possible, my interests are in the behavior of fluids at a phase boundary, such as a real gas at a solid
surface: the attractive forces of the solid cause an increased concentration (density) of the gas in the region near the surface, 
a measureable phenomenon called "adsorption". Thermodynamics requires that, at equilibrium at a constant temperature and 

Poor MKL Dfti complex to complex performance

Hello,

I'm new to MIC programming and trying to get a grip on how to do things with the beast. I stumbled accros very bad FFT performance (using a matrix size often used at our institution) for dfti complex to complex transforms. In the following. no OMP, KMP, MKL variables are set, except when stated. Setting the number of threads or specifying the placement does not change much for this comparison: The mic is much slower than the host!

Any hints how to improve the situation?

Sincerely,

HC

Naiive Hardware Configuration Question.

 

Yet another naiive question. If I establish 2 compute nodes in my sandbox am I generally better off with a mic and 2 gpgpu per node? I'm guessing the answer is, it depends... But assuming that the mics leverage the vector processing in the gpus then pci seems like less of a bottleneck than qdr. My googling isn't showing big boxes with Frankenstein nodes but in my empty head it seems like a good idea.

 

Thanks again Robert

 

Better than linear scaling

Hi,

I'm currently wondering about the scaling of my application with the number or cores. Basically I'm getting a 70x speedup with 56 cores compared to a single core. However the whole plot (see attached image) still looks more or less like a line but with a slope of about 1.2. I'm runing this application with thread numbers that are multiples of 4 and KMP_AFFINITY set to compact. What could possibly explain my curve?

Il processore Intel® Core™ M

This article, aimed at developers, will provide a glimpse into this 64-bit, multi-core SOC processor, with an overview of the available Intel technologies, including Intel® HD Graphics 5300.
  • 开发人员
  • 合作伙伴
  • 教授
  • 学生
  • 安卓*
  • Google Chrome OS*
  • Microsoft Windows* 8
  • 企业客户端
  • 英特尔® 实感™ 技术
  • Windows*
  • Core
  • m
  • Low Power
  • 图形
  • 英特尔® 酷睿™ 处理器
  • 英特尔® 博锐™ 技术
  • Microsoft Windows* 8 Desktop
  • 移动性
  • 优化
  • 能效
  • 安全
  • 小型企业
  • 触摸界面
  • Encoder and VPP setting for 720p50 encoding

    Hello,

    My application generating 50 progresive frames (resolution 1280 * 720) per second.

     I want to encode this frames using IQSV encoder in 720p50 mode (1280*720 50 frames)

    So what will be the encoder setting and VPP setting?

    Also I want to know we have to pass only 25 progressive frame to encoder or we can pass all 50 frames to encoder.

    Currently if I send all 50 frames for encoding, then output file is duration is double than duration for which we record.

    If I send alternate progressive frame to encoder then output file is proper.

    Lack of Ivy Bridge support in current VTune

    Slide 9 of https://software.intel.com/sites/default/files/Using_Intel_VTune_Amplifi...

    shows a Sandy Bridge/Ivy Bridge Analysis category of pre-configured profiles.  However, with the XE2015 Update 2, there are categories for Sandy Bridge and Haswell, but none (besides General) which work for Ivy Bridge (see attached).   Which version of VTune, if any, is recommended for Ivy Bridge Memory Access analysis on

    订阅 优化