<?xml version="1.0" encoding="UTF-8"?>
<!-- Generated on Wed, 25 Nov 2009 02:43:29 -0800 -->
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <atom:link href="http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/feed" rel="self" type="application/rss+xml" />
    <title>Intel Software Network - <![CDATA[ Problems related to Intel Architecture Code Analyzer ]]> feed</title>
    <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759</link>
    <description></description>
    <language>en-us</language>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin:0px;">
<div id="quote_reply" style="width: 100%; margin-top: 5px;">
<div style="margin-left:2px;margin-right:2px;">Quoting - <a href="/en-us/profile/439568">babysam</a></div>
<div style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"><em>Hello everyone!<br /><br />I have used the IACA for nearly a month and it is great!<br />It help me to solve many problems that affects the performance of my code...<br />(and it is simple to use as long as you have the source code in hand)<br /><br />However, I have noticed something strange...<br />Even though the analyzer has correctly identified the code for rcpss/rsqrtss, <br />they are treated as the same as the divss/sqrtss respectively.(As I know, <br />both of the approximation instructions should run faster than the accurate <br />ones. However, the analyzer shows they are blocking the divider port for 14<br /> cycles) Is it a bug or something else?<br /><br />Thank you for your attention!</em></div>
</div>
</div>
<br />Hi,<br /><br /> Thanks for the input on Intel(R) Architecture Code Analyzer.<br /><br />You are correct, the rcp and rsqrt instructions were misclassified as divider operations.<br />We are working on providing a fix for this issue.<br /><br />Tal<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Wed, 19 Aug 2009 23:15:56 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin:0px;"></div>
A fixed version is now available. (ver 1.0.2)<br /><br />Tal<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Mon, 24 Aug 2009 12:08:53 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <br />[url=http://www.dvdcollects.com/products/Desperate-Housewives-Season-1-5-DVD-Boxset-DVDS-1698.html]Desperate Housewives Season 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/24-Hours-Seasons-1-7-DVD-Boxset-DVDS-1684.html]24 Hours Seasons 1-7 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Ally-McBeal-Seasons-1-5-DVD-Boxset-DVDS-1264.html]Ally McBeal Seasons 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/The-King-of-Queens-Seasons-1-9-DVD-Boxset-DVDS-1744.html]The King of Queens Seasons 1-9 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Xena-Warrior-Princess-Season-1-6-DVD-Boxset-DVDS-1418.html]Xena: Warrior Princess Season 1-6 DVD Boxset[/url]<br /><br />[url=http://www.dvdcollects.com/products/30-Rock-Seasons-1-3-DVD-Boxset-DVDS-1199.html]30 Rock Seasons 1-3 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Angel-Seasons-1-5-DVD-Boxset-DVDS-1265.html] Angel Seasons 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Buffy-The-Vampire-Slayer-Seasons-1-7-DVD-Boxset-DVDS-1769.html]Buffy The Vampire Slayer Seasons 1-7 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Mistresses-Seasons-1-2-DVD-Boxset-DVDS-1768.html]Mistresses Seasons 1-2 DVD Boxset [/url]<br />[url=http://www.dvdcollects.com/products/Samantha-Who-Seasons-1-2-DVD-Boxset-DVDS-1762.html]Samantha Who Seasons 1-2 DVD Boxset[/url]<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Wed, 02 Sep 2009 01:27:37 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <br />[url=http://www.dvdcollects.com/products/Desperate-Housewives-Season-1-5-DVD-Boxset-DVDS-1698.html]Desperate Housewives Season 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/24-Hours-Seasons-1-7-DVD-Boxset-DVDS-1684.html]24 Hours Seasons 1-7 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Ally-McBeal-Seasons-1-5-DVD-Boxset-DVDS-1264.html]Ally McBeal Seasons 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/The-King-of-Queens-Seasons-1-9-DVD-Boxset-DVDS-1744.html]The King of Queens Seasons 1-9 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Xena-Warrior-Princess-Season-1-6-DVD-Boxset-DVDS-1418.html]Xena: Warrior Princess Season 1-6 DVD Boxset[/url]<br /><br />[url=http://www.dvdcollects.com/products/30-Rock-Seasons-1-3-DVD-Boxset-DVDS-1199.html]30 Rock Seasons 1-3 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Angel-Seasons-1-5-DVD-Boxset-DVDS-1265.html] Angel Seasons 1-5 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Buffy-The-Vampire-Slayer-Seasons-1-7-DVD-Boxset-DVDS-1769.html]Buffy The Vampire Slayer Seasons 1-7 DVD Boxset[/url]<br />[url=http://www.dvdcollects.com/products/Mistresses-Seasons-1-2-DVD-Boxset-DVDS-1768.html]Mistresses Seasons 1-2 DVD Boxset [/url]<br />[url=http://www.dvdcollects.com/products/Samantha-Who-Seasons-1-2-DVD-Boxset-DVDS-1762.html]Samantha Who Seasons 1-2 DVD Boxset[/url]<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Wed, 02 Sep 2009 01:27:43 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin: 0px; height: auto;"></div>
Hello, I'm trying out the IACA 1.1 for Win32. <br />Is the following result a bug?
<pre name="code" class="plain">Intel(R) Architecture Code Analyzer Version - 1.1.0
Analyzed File - regrename.obj
Binary Format - 32Bit
Architecture  - Intel(R) AVX

Analysis Report
---------------
Total Throughput: 1 Cycles;             Throughput Bottleneck: FrontEnd, Port0
Total number of Uops bound to ports:  1
Data Dependency Latency:    1 Cycles;   Performance Latency:    1 Cycles

Port Binding in cycles:
-------------------------------------------------------
|  Port  |  0 - DV |  1 |  2 -  D |  3 -  D |  4 |  5 |
-------------------------------------------------------
| Cycles |  1 |  0 |  0 |  0 |  0 |  0 |  0 |  0 |  0 |
-------------------------------------------------------

N  - port number, DV - Divider pipe (on port 0), D - Data fetch pipe (on ports 2 and 3) 
CP - on a critical Data Dependency Path
N  - number of cycles port was bound
X  - other ports that can be used by this instructions
F  - Macro Fusion with the previous instruction occurred
^  - Micro Fusion happened
*  - instruction micro-ops not bound to a port
@  - Intel(R) AVX to Intel(R) SSE code switch, dozens of cycles penalty is expected
!  - instruction not supported, was not accounted in Analysis

| Num of |          Ports pressure in cycles          |    |
|  Uops  |  0 - DV |  1 |  2 -  D |  3 -  D |  4 |  5 |    |
------------------------------------------------------------
|   1    |  1 |    |  X |    |    |    |    |    |  X | CP | pxor xmm0, xmm1
|   0*   |    |    |    |    |    |    |    |    |    |    | pxor xmm2, xmm2
|   0*   |  X |    |  X |    |    |    |    |    |  X | CP | vpxor xmm3, xmm3, xmm4
|   0*   |    |    |    |    |    |    |    |    |    |    | vpxor xmm5, xmm5, xmm5
</pre>
About "<strong><em>vpxor xmm3, xmm3, xmm4</em></strong>" instruction, it seems must to be decoded to 1 uop. <br />So I think that the interpretation of the first operand and the third operand is contrary on the IACA. <br />Other some operations that destination register value absolutely becomes zero-all (ex: <em>xorps</em>, <em>xorpd</em>, <em>psub*</em>, <em>pcmpgt*</em>) are similar too. <br /><br />In addition, IACA will be segfault by the following code (only 0 uop instruction(s) between markers).
<pre name="code" class="plain">;START_MARKER
mov ebx, 111
db 0x64, 0x67, 0x90

vpxor xmm0, xmm0, xmm0 ;0 uop

;END_MARKER
mov ebx, 222
db 0x64, 0x67, 0x90
</pre> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Mon, 26 Oct 2009 06:46:18 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin:0px;">
<div id="quote_reply" style="width: 100%; margin-top: 5px;">
<div style="margin-left:2px;margin-right:2px;">Quoting - <a href="/en-us/profile/449138">seizh</a></div>
<div style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"><em> Hello, I'm trying out the IACA 1.1 for Win32. <br />Is the following result a bug?
<pre name="code" class="plain">Intel(R) Architecture Code Analyzer Version - 1.1.0<br />Analyzed File - regrename.obj<br />Binary Format - 32Bit<br />Architecture  - Intel(R) AVX<br /><br />Analysis Report<br />---------------<br />Total Throughput: 1 Cycles;             Throughput Bottleneck: FrontEnd, Port0<br />Total number of Uops bound to ports:  1<br />Data Dependency Latency:    1 Cycles;   Performance Latency:    1 Cycles<br /><br />Port Binding in cycles:<br />-------------------------------------------------------<br />|  Port  |  0 - DV |  1 |  2 -  D |  3 -  D |  4 |  5 |<br />-------------------------------------------------------<br />| Cycles |  1 |  0 |  0 |  0 |  0 |  0 |  0 |  0 |  0 |<br />-------------------------------------------------------<br /><br />N  - port number, DV - Divider pipe (on port 0), D - Data fetch pipe (on ports 2 and 3) <br />CP - on a critical Data Dependency Path<br />N  - number of cycles port was bound<br />X  - other ports that can be used by this instructions<br />F  - Macro Fusion with the previous instruction occurred<br />^  - Micro Fusion happened<br />*  - instruction micro-ops not bound to a port<br />@  - Intel(R) AVX to Intel(R) SSE code switch, dozens of cycles penalty is expected<br />!  - instruction not supported, was not accounted in Analysis<br /><br />| Num of |          Ports pressure in cycles          |    |<br />|  Uops  |  0 - DV |  1 |  2 -  D |  3 -  D |  4 |  5 |    |<br />------------------------------------------------------------<br />|   1    |  1 |    |  X |    |    |    |    |    |  X | CP | pxor xmm0, xmm1<br />|   0*   |    |    |    |    |    |    |    |    |    |    | pxor xmm2, xmm2<br />|   0*   |  X |    |  X |    |    |    |    |    |  X | CP | vpxor xmm3, xmm3, xmm4<br />|   0*   |    |    |    |    |    |    |    |    |    |    | vpxor xmm5, xmm5, xmm5<br /></pre>
About "<strong><em>vpxor xmm3, xmm3, xmm4</em></strong>" instruction, it seems must to be decoded to 1 uop. <br />So I think that the interpretation of the first operand and the third operand is contrary on the IACA. <br />Other some operations that destination register value absolutely becomes zero-all (ex: <em>xorps</em>, <em>xorpd</em>, <em>psub*</em>, <em>pcmpgt*</em>) are similar too. <br /><br />In addition, IACA will be segfault by the following code (only 0 uop instruction(s) between markers).
<pre name="code" class="plain">;START_MARKER<br />mov ebx, 111<br />db 0x64, 0x67, 0x90<br /><br />vpxor xmm0, xmm0, xmm0 ;0 uop<br /><br />;END_MARKER<br />mov ebx, 222<br />db 0x64, 0x67, 0x90<br /></pre>
</em></div>
</div>
</div>
<br />Hi,<br /><br />Thank you for your input. The checking of the idiom was incorrect for the AVX version.<br /><br />I'm currently working on fixing this matter and other matters as well. I will update when a fix will be available on whatif.<br /><br />Tal <br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Mon, 26 Oct 2009 08:32:41 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin:0px;">
<div id="quote_reply" style="width: 100%; margin-top: 5px;">
<div style="margin-left:2px;margin-right:2px;">Quoting - <a href="/en-us/profile/265147">Tal Uliel (Intel)</a></div>
<div style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"><em>
<div style="margin:0px;"></div>
<br />Hi,<br /><br />Thank you for your input. The checking of the idiom was incorrect for the AVX version.<br /><br />I'm currently working on fixing this matter and other matters as well. I will update when a fix will be available on whatif.<br /><br />Tal <br /></em></div>
</div>
</div>
thank you i will publish all...<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Fri, 30 Oct 2009 08:59:02 -0700</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
    <item>
      <title>Re: Problems related to Intel Architecture Code Analyzer</title>
      <description><![CDATA[ <div style="margin:0px;">
<div id="quote_reply" style="width: 100%; margin-top: 5px;">
<div style="margin-left:2px;margin-right:2px;">Quoting - <a href="/en-us/profile/265147">Tal Uliel (Intel)</a></div>
<div style="background-color:#E5E5E5; padding:5px;border: 1px; border-style: inset;margin-left:2px;margin-right:2px;"><em>
<div style="margin:0px;"></div>
<br />Hi,<br /><br />Thank you for your input. The checking of the idiom was incorrect for the AVX version.<br /><br />I'm currently working on fixing this matter and other matters as well. I will update when a fix will be available on whatif.<br /><br />Tal <br /></em></div>
</div>
</div>
<br />A fixed version (1.1.1) was released today.<br /><br />Tal<br /> ]]></description>
      <link>http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</link>
      <pubDate>Thu, 05 Nov 2009 11:04:14 -0800</pubDate>
      <guid isPermaLink="true">http://software.intel.com/en-us/forums/whatif-alpha-software/topic/67759/</guid>
      <category>Parallel Programming</category>
      <category>ISN General</category>
    </item>
  </channel></rss>