Intel® Developer Zone:
Desempenho

Destaques

Recentemente publicado! Intel® Xeon Phi™ Coprocessor High Performance Programming 
Aprenda os conceitos básicos da programação para essa nova arquitetura e novos produtos. Novo!
Intel® System Studio
O Intel® System Studio e uma abrangente suíte de ferramentas integradas de desenvolvimento de software que pode diminuir o tempo de lançamento do produto no mercado, aumentar a confiabilidade do sistema e melhorar a eficiência energética e o desempenho. Novo!
No caso de você ter perdido - Repetição do webinar de dois dias
Introdução ao desenvolvimento de aplicativos de alto desempenho para os coprocessadores Intel® Xeon & Intel® Xeon Phi™.
Structured Parallel Programming
Os autores Michael McCool, Arch D. Robison e James Reinders usam uma abordagem baseada em padrões estruturados que podem tornar o assunto acessível a qualquer desenvolvedor de software.

Forneça aos clientes o melhor desempenho de seus aplicativos com a programação paralela e a ajuda dos inovadores recursos da Intel.

Recursos de desenvolvimento


Ferramentas de desenvolvimento

 

Intel® Parallel Studio

Trazendo um paralelismo simplificado e completo para os desenvolvedores do Microsoft Visual Studio* C/C++, o Intel® Parallel Studio tem ferramentas avançadas para otimizar aplicativos cliente para multi-core e muitos cores (núcleos).

Produtos Intel® para desenvolvimento de software

Explore todas as ferramentas para ajudar você a otimizar na arquitetura Intel. Ferramentas selecionadas estão disponíveis por um período de avaliação gratuita de 45 dias.

Base de conhecimento das ferramentas

Guias e informações de suporte para as ferramentas Intel.

Understanding How General Exploration Works in Intel® VTune™ Amplifier XE
Por Jackson Marusarz (Intel)Publicado em 02/09/20150
The General Exploration Analysis Type in Intel® VTune™ Amplifier XE is used to detect microarchitectural hardware bottlenecks in an application or system. General Exploration uses hardware event counters to detect and locate issues and presents the data in a user-friendly and actionable format. T...
The Generic Address Space in OpenCL™ 2.0
Por Adam Lake (Intel)Publicado em 02/06/20150
Introduction What is the Generic Address Space? Enabling the Generic Address Space Why Would I Want to Use the Generic Address Space? Performing Some Operations in a Specific Address Space Address Space Casting Performance Implications and How to Address Them A Working Example Future Work...
Testimonials
Por Harish Patil (Intel)Publicado em 02/05/20150
I. From Rice University Prof. John Mellor-Crummey's talk at Supercomputing 2014 (see PinPlay description on slides 6-8):Introduction to Correctness and Performance Tools for Parallel Programming. The International Conference for High Performance Computing, Networking, Storage and Analysis (SC14)...
DrDebug : Linux Command Line Usage
Por Harish Patil (Intel)Publicado em 02/05/20150
Using DrDebug requires following two phases1. recording and 2. replaying. Pre-requisites Setup  Recording With GDB From command line (without GDB)  Replaying With GDB From command line (without GDB)  Pre-requisites GDB version 7.4 or higher with Python support PinPlay/DrDebu...
Assine o Artigos do Espaço do desenvolvedor Intel
Nenhum conteúdo foi encontrado
Assine o Blogs do Intel® Developer Zone
Linking against both the sequential and threaded mkl
Por Mark Thomas5
I have two dlls that link against the static mkl libraries.  One of the dlls links against the sequential version and the other against the multi-threaded version.  Those two dlls are then loaded in to the same process.  Does anybody know whether this is safe to do please? Kind regards Mark
How to track down OpenMP segfault caused by the addition of ORDERED?
Por Alastair M.4
Dear all, I hope this is the right place to ask this question. I am working on adding OpenMP support to some existing Fortran code, using ifort version 15. I noticed that the addition of the c$OMP ORDERED clause to my outer parallel do loop causes the program to segfault in the second loop iteration, when attempting to access a FIRSTPRIVATE variable.  This occurs with OMP_NUM_THREADS=1.  The same error also occurs with ifort 14.0.2. On further inspection I realised that at some point during the 2nd loop iteration the stack becomes corrupted.  That is, "info locals" in gdb complains about not being able to read certain variables, when it previously could, and then the segfault follows shortly afterwards.  I also noticed that the location of the segfault is repeatable but changes when the list of FIRSTPRIVATE variables is changed. With the ORDERED construct removed from the loop, the program executes correctly and tests with valgrind and inspxe indicate zero problems.  I have ulimit -...
Where can i download Intel MPI Benchmarks?
Por Bo W.1
Hello everyone, where can i  download the intel mpi benchmarks? Cheers, Bo
'Wildhoney' - the 512bit superfast textual decompressor - some thoughts
Por Georgi M.19
Hi to all. Glad I am that finally joined the Intel forum, long overdue. Here I want to share my amateurish vision on superfast textual decompression topic. For 4 months now I have been playing with my file-to-file decompressor named Nakamichi. I am on quest for writing the fastest possible variant of my approach, branchlessness combined with one only native (hifhest order) register on latest machines. This translates to 64bit/512bit mixed code. Few hours ago I wrote 'Wildhoney' variant using just that configuration. And two important things: - Nakamichi is 100% FREE - no restictions at all for modifying as the original Lempel-Ziv was; - Speed is religion, the fastestness is the ultimate goal. So far, I have written two OpenMP console tools, each enforcing 16 threads - MokujIN and Kazahana, I hope Nakamichi 'Wildhoney' to be the third. Any help in developing it I would appreciate, many basic still things I don't know. The ZMM executable with the C source is here:http://www.san...
need something like a sorted tbb::parallel_do
Por foelsche@sbcglobal.net1
    from what I see there is tbb::concurrent_priority_queue.         but with this I would have to deal with thread pools myself.       is this really true?
TBB: Using task_scheduler_observer to set worker thread's OS scheduling priority
Por Tim Day5
I'm looking at TBB's task_arena and task_scheduler_observer. The documentation for task_scheduler_observer sketches out a nice example of it being used to set thread affinity on worker threads to lock an arena's threads onto a subset of cores. I'm curious to know whether this class and a similar pattern could practically be used to set OS scheduling priority for an arena.  What I'm interested in doing is, on my N core HW, creating an arena with N normal worker threads, and another arena with N threads on a lower OS scheduling priority.  However, the issue with scheduler priority is that generally you only get to lower it (unless running as root, but assume not), and it's not clear to me to what extent TBB worker threads move around between arenas (which would defeat the object of keeping all the low priority threads in one arena); the task_scheduler_observer docs mention returning false from on_scheduler_leaving() to keep a thread in an arena... but also mentions the possibility of ...
API for Haswells TSX
Por roberto c.2
hello, i have just begun my research focus with HTM, primarily focusing on RTM(restricted transaction memory). is there any APIs for RTM? I have looked on the internet but only the basic operands exist for RTM, such as xbegin, xend, xabort, xtest. I want to be able to access the shared memories with HTM but i can not find any library files for it.  Can you please point me in the right direction, thanks for your support.
CL_DEVICE_TYPE_CPU not working in Windows 8.1
Por Yaknan G.1
Hi, I recently tried to run my OpenCL program on a new windows 8.1 computer but the program returns an error when the device type is CL_DEVICE_TYPE_CPU. When I change the device type to a CL_DEVICE_TYPE_GPU or CL_DEVICE_TYPE_ ALL it ran the program on the GPU. Here is the system specification of the new computer: OS: Windows 8.1 Processor: Intel Core i7 - 4700MQ clocked at 2.40GHz Display Adapter: Intel HD Graphic 4600 and NVIDIA GeForce GT 740M How can I resolve this problem and is OpenCL having issues with windows 8.1? Please help! Yaknan
Assine o Fóruns

Destaques