Intel® AVX and CPU Instructions
Instruction decoder
I'm working at the moment with decoding instructions for a given executable (windows or linux) or a part of memory. It can be described as a simple disassembler. Your manuals on Pentium processors are great help but I miss some of the most basic aspects of the processor architecture. (currently I've been using Intel 64 and IA-32 Architectures Software Developer's Manual Volume 1, Basic Architecture and Intel 64 and IA-32 Architectures Software Developer's Manual Volume 2A, Instruction Set Reference)
Instruction decoder
I'm working at the moment with decoding instructions for a given executable (windows or linux) or a part of memory. It can be described as a simple disassembler. Your manuals on Pentium processors are great help but I miss some of the most basic aspects of the processor architecture. (currently I've been using Intel 64 and IA-32 Architectures Software Developer's Manual Volume 1, Basic Architecture and Intel 64 and IA-32 Architectures Software Developer's Manual Volume 2A, Instruction Set Reference)
reverse a PMOVMSKB instruction?
I've been searching for days for the reverse of the PMOVMSKB instruction.
I want to collapse a 64bit result down to 8bits and then restore it again.
for example
0xFF00FF00FF00FF00 = (PMOVMSKB) 10101010b = 0xFF00FF00FF00FF00
Can anyone help. If I had hair I'd be pulling it out :-)
REAL(16) - long double
Are there any plans to implement the IEEE X_floating point (16 byte) with SSEn.m instructions?
Although this might not be suitable for mainstream (desktop) computing, it would be suitable for premium priced products. Intel does have premium priced product lines e.g. mmm Extreme. You could lable it at mmm Extreme+ and double the price.
Jim Dempsey
Documentation suggestion
This is a suggestion for the documentation relating to SSEn.m
1) Can you add a section that seperates the instructions by type of data. This section would essentially be an index and hyperlinks back to the alphabetical listing. I find it cumbersome to weed through the alphabetical listing.
SSEn.m Instructions by data type
Integer Byte
Integer Word
Integer Double Word
Integer Quad Word
Float
...
Double
...
IA32 ISA
Opcode,ISA Encode,Decode
Performance difference between 32bit and 64bit memcpy
We have Core2 machines (Dell T5400) with XP64.
We observe that when running 32-bit processes, the throughput of memcpy is on the order of 1.2GByte/s; however memcpy in a 64-bit process achieves about 2.2GByte/s (or in fact 2.4GByte/s with the Intel compiler CRT's memcpy).
assembly program in itanium based systems
Hi,
I'veHP-UX xxxxxxxx B.11.23 U ia64 in itanium processor. I want to write and analyse assembly program in itanium system. I can get the program by generating the assembly code using gcc -S option from C source code. I am not getting the meaning of the instruction as well as assembly program flow.If I get assembly code list, meaning of that perticular code as well as the role of that code in the program, it will be a very helpful to me. thank youWhy "subq" as allocate by ICC-v10.0 but not as prologue, but ICC-v11.0 uses "pushq" as prologue?
Hi All.
Below is a piece of CPP code and behaviour of asm due to ICC-v11.0 & ICC-v10.0 -
--
#include
#include
#define MAX 1024
int main()
{
int i, j;
int num[MAX], isort[MAX], cluster[MAX][MAX];
for (j = 0; j < MAX; j++) {
num[j] = 0;
isort[j] = j;
for (i = 0; i < MAX; i++) {
cluster[j][i] = 0;
}
}
printf("%d %d %d\n",num[64],isort[78],cluster[384][74]);
return 0;
}
--
