|
Selected Papers: (for complete list please
check my CV ) [Book] Mohamed Zahran, Heterogeneous Computing: Hardware and Software Perspectives, ACM Books, 2019 [ISBN: 9781450362337 | PDF ISBN: 9781450361002 [46] D. Tantawy, M. Zahran and A. G. Wassal, PTcomp: Post-Training Compression Technique for Generative Adversarial Networks, in IEEE Access, vol. 11, pp. 9763-9774, 2023. [45] Dina Tantawy, Mohamed Zahran, Amr Wassal, A Survey on GAN Acceleration Using Memory Compression Technique, Journal of Engineering and Applied Science, [44] Nick Greenquist, Doruk Kilitcioglu, Mohamed Zahran and Anasse Bari, GPU Accelerated Matrix Factorization for Recommender Systems, the 6th IEEE International Conference on Big Data Analytics (ICBDA 2021), March 2021. (Best Presentation Award) [43] Antonio Mallia, Michał Siedlaczek, Torsten Suel, and Mohamed Zahran ,GPU-Accelerated Decoding of Integer Lists, in The 28th ACM International Conference on Information and Knowledge Management (CIKM), Beijing, China, November 2019. [42] Tulsi Jain, Nitish Agarwal, and Mohamed Zahran, Performance Prediction for Multi-threaded Applications in The 2nd International Workshop on AI-assisted Design for Architecture (AIDArc), held in conjunction with the International Symposium on Computer Architecture (ISCA), June 2019. [41] Mohamed Zahran and Marsha Berger, "Parallel Computing At The Undergraduate Level: Lessons Learned and Insights", in Workshop on Computer Architecture Education Held in conjunction with 46th International Symposium on Computer Architecture (ISCA), June 2019. [40] Mahmoud Khairy, Amr Wassal, and Mohamed Zahran, A survey of architectural approaches for improving GPGPU performance, programmability and heterogeneity, Elsevier Journal of Parallel and Distributed Computing, Volume 127, May 2019, Pages 65-88. [39] Chris Quackenbush and Mohamed Zahran, Beyond Profiling, in The 1st International Workshop on AI-assisted Design for Architecture (AIDArc), [38] Chris Quackenbush and Mohamed Zahran, Beyond Profiling: Scaling Profiling Data Usage to Multiple Applications, arXiv:1711.01654 , 2017. [37] Mahmoud Khairy, Mohamed Zahran, and Amr Wassal, SACAT: Streaming-Aware Conflict-Avoiding Thrashing-Resistant GPGPU Cache Management Scheme, IEEE Transactions on Parallel and Distributed Systems, vol 28, issue 6, June 2017. [36] Numair Khan and Mohamed Zahran, Space-efficient Pointwise Computation of the Distance Transform on GPUs, in 7th IEEE Workshop Parallel / Distributed Computing and Optimization [35] Chris Rohlfs and Mohamed Zahran, Optimal Bandwidth Selection for Kernel Regression Using a Fast Grid Search and a GPU, in 7th IEEE Workshop Parallel / Distributed Computing and Optimization (PDCO 2017), in conjunction with 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS), May 2017. [34] Mohamed Zahran, Heterogeneous Computing: Here to Stay, ACM Queue, vol 14, No. 6, Nov/Dec 2016, and Communications of the ACM, March 2017. [33] Mohamed Zahran, Brain-Inspired Machines: What, Exactly, Are We Looking for?, IEEE Pulse, Mar 2016. [32] Mahmoud Khairy, Mohamed Zahran, and Amr G. Wassal, Efficient utilization of GPGPU cache hierarchy, in the 8th Workshop on General-purpose processing using [31] M. Zahran, Multicore processors: Status quo and future directions, in 10th International Computer Engineering Conference (ICENCO), Dec 2014 (Invited Paper) (pdf). [30] J. Rajendran, A. K. Kanuparthi, M. Zahran, S. Addepalli,
G. Ormazabal, and R. Karri, Securing processors against insider attacks:
a circuit-microarchitecture co-design approach, IEEE Design
and Test of C2mputers, Vol 30, issue 2, Mar/Apr, 2013 [28] H. Chtioui, S. Niar Lamih, R. Ben-Atitallah, M. Zahran, Jl. Dekeyser, andM. Abid, A Dynamic Hybrid Cache Coherency Protocol for Shared-Memory MPSoC Architectures, [27] Corey Malone, Mohamed Zahran, and Ramesh Karri, Are Hardware Performance Counters a Cost Effective Way for Integrity Checking of Programs?, The Sixth ACM Workshop on Scalable Trusted Computing, October 2011. (pdf) [26] Mohamed Salah Souahi, Smail Niar, Mohamed Zahran, Mohamed Benmohamed, Towards Dynamic Cache Block Placement for Multi-processor NUCA, IEEE International Conference on Microelectronics, December 2011. [25] Artem Durytskyy, Mohamed Zahran, and Ramesh Karri, Improving Robustness of GPUs by Making Use of Faulty Parts, Proc. International Conference on Computer Design (ICCD11), October 2011. (pdf) [24] Arun K. Kanuparthi, Mohamed Zahran, and Ramesh Karri, Feasibility Study of Dynamic Trusted Platform Module, Proc. International Conference on Computer Design (ICCD10), [23] Ahmed Youssef, Mohamed Zahran, Mohab Anis, and Mohamed Elmasry,
On the Power Management of Simultaneous
Multithreading Processors, IEEE Transactions on VLSI , [22] Mohamed Zahran and Sally A. McKee, Global Management of Cache Hierarchies , The ACM International Conference on Computing Frontiers (CF'10), Italy, May 2010. (pdf) [21] Yufu Zhang , Ankur Srivastava and Mohamed Zahran, On-Chip Sensor Driven Efficient Thermal Profile Estimation Algorithms, ACM Transactions on Design Automation of Electronic Systems, Vol 15, issue 3, May 2010.[20] Kim Hazelwood and Mohamed Zahran. Challenges and Opportunities at All Levels: Interactions Among Operating Systems, Compilers, and Multicore Processors, ACM SIGOPS Operating System Review. Volume 43, Issue 2. April 2009.[19] Najla Alfaraj,
H. Jonathan Chao, and Mohamed Zahran, NBC: Network-based
Cache Coherence Protocol for Multistage NoCs, in The International SoC Design Conference
(ISOCC), 2009.
[18] Bushra Ahsan and Mohamed Zahran, Managing
Off-Chip Bandwidth: A Case for Bandwidth-Friendly Replacement Policy, in The 2nd
Workshop on Managed Multi-Core Systems (MMCS'09), held in
conjunction
with ASPLOS 2009. (pdf)
[17]
Mohamed Zahran and Sally A. McKee, Adaptive Block Placement
Policy for Cache Hierarchies,in SMART'09:3rd Workshop on Statistical
and Machine learning approaches to ARchitectures and compilaTion, held
in conjunction with HiPEAC 2009. (pdf) [16] Bushra
Ahsan and Mohamed Zahran, Cache Performance, System Performance,
and Off-Chip Bandwidth... Pick any Two
, in 3rd workshop Interconnection Network
Architectures: On-Chip, Multi-Chip (INA-OCMC), held in
conjunction with HiPEAC 2009. (pdf)
[12] Mohamed
Zahran, Kursad Albayraktaroglu, and Manoj Franklin,
Non-Inclusion Property in multi-level Caches Revisited, in the International
Journal of Computers and Their Applications Special Issue on
Techniques and Architectures for High Performance and Energy
Efficient Computing Systems, Vol 14, Num 2, June 2007. ( bib,
pdf) [10] Mohamed
Zahran and Anasua Bhowmik, Bandwidth-Friendly Cache Hierarchy, in The 2006 International Conference on Computer
Design (CDES06), Las Vegas, 2006. (bib, pdf) [9] Mohamed
Zahran and Anasua Bhowmik, Hybrid Compiler and Microarchitecture Technique for Cache Traffic Optimization, in
9th Workshop on Interaction between Compilers and Computer
Architectures (INTERACT 9), held in Conjunction with the 11th
International Symposium
on High-Performance Computer Architecture (HPCA-11), 2005. (bib, pdf) [8] Francois Cantonnet, Yiyi Yao, Mohamed Zahran and Tarek El-Ghazawi, Productivity Analysis of the UPC Language, in 3rd International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS), to be held in conjunction with the International Parallel and Distributed Processing Symposium (IPDPS 2004). [7] Mohamed
Zahran and Manoj Franklin, Dynamic Thread Resizing for Speculative Multithreaded
Processors, in International Conference
on Computer
Design (ICCD), San Jose, CA, October, 2003. (pdf) (Best Paper Award)
[6] Mohamed
Zahran, Manoj Franklin and Renju Thomas, Confidence Estimation
for Register Value
Communication in Speculative Multithreaded
Architectures, in first value prediction workshop
(VPW1), held in conjunction with the 30th
Annual International Symposium on Computer
Architecture (ISCA), San Diego, California, 2003. (pdf) [5] Mohamed
Zahran, On Cache Memory Hierarchy for Chip-Multiprocessor, in MEDEA workshop held in conjunction
with PACT 2002 Conference, Charlottesville, Virginia, 2002. Also
Appeared in ACM Computer Architecture News, Vol 31, No. 1,
March 2003. [4] Mohamed
Zahran and Manoj Franklin, Return Address Prediction in
Speculative Multithreaded Environments, in Int'l Conference on
Hi-Performance Computing, Bangalore, India, 2002. (pdf) [3] Mohamed
Zahran and Manoj Franklin, A Feasibility Study of Hierarchical
Multithreading, in International
Parallel and Distributed Processing Symposium (IPDPS 2002),
Marriott Marina, Fort Lauderdale, Florida, 2002. (pdf) [2] Mohamed
Zahran and Manoj Franklin, Hierarchical Multi-threading
For Exploiting Parallelism
at Multiple Granularities, Workshop on
MULTITHREADED EXECUTION, ARCHITECTURE and COMPILATION (MTEAC-5), Austin, Texas, 2001.
(pdf) [1]
Mohamed Zahran, Ashraf Abdel-Wahab and Samir Shaheen, Adaptive
Genetic Algorithm
for Multiprocessor Scheduling, poster
presentation at the Genetic and Evolutionary Computation Conference (GECCO),
Orlando, 1999. Selected Presentations & Talks (for complete list
please check my CV
): [2] Architecture Support for Big Data, Bloomberg, November 2016. [3] Panel at IBM Research Workshop on Architectures for Cognitive Computing and Datacenters, IBM T. J. Watson lab , October 2016. [4] Heterogeneous Computing: Hardware and Software Perspective, ACM Applicative, June 2016. [5] "Off-Chip Bandwidth: The New Wall in The Multicore Era", in CS Departmental seminar series, University of Delaware, March 2009. [6 ] "Attacking The Von-Neumann Bottleneck: Smart and Scalable Cache Hierarchy in The Chip Multiprocessor Era", IBM T. J. Watson, Feb 2007. |