Performance tuning of an existing application is truly a challenge and it depends on a lot of factors like the nature of algorithm the application works on, if the implementation is scalable to take advantage of thread/data parallelism etc. The most logical approach any developer would follow for tuning the performance of an application is to do a dynamic profiling of the application under different workloads, try to analyze the hotspots in that application, and then fine tune them to work best on a given hardware architecture.
Intel® Advanced Vector Extensions 512 (Intel® AVX-512)
The figures, tables, drawings, etc. used in our book can be downloaded from the book's website. We appreciate attribution, but there are no restrictions on use in educational material (presentations)!
Suggestion attribution: (c) 2013 Jim Jeffers and James Reinders, used with permission.
This paper focuses on optimizing the design and deployment of Web-tier services using Intel® based servers, communications, and storage solutions to achieve the best balance of high-performance, low power consumption, high-availability, affordability and low Total Cost of Ownership (TCO).
Traditional Business Intelligence (BI) infrastructure cannot handle today’s massive, diverse, and fast-moving data streams. In this brief, learn how Apache Hadoop* running on Intel® Architecture provides a foundation you can implement today for targeted value, and then expand almost without limit to meet growing needs.
This two day webinar series introduces you to the world of multicore and manycore computing with Intel® Xeon® processors and Intel® Xeon Phi™ coprocessors. Expert technical teams at Intel discuss development tools, programming models, vectorization, and execution models that will get your development efforts powered up to get the best out of your applications and platforms.