Publications

(2020). Semantic Nonnegative Matrix Factorization with Automatic Model Determination for Topic Modeling. In: Proceedings of the 19th {IEEE} International Conference on Machine Learning and Applications, {ICMLA}, IEEE, pp. 328–335.

(2020). NVIDIA GPGPUs Instructions Energy Consumption. In Proceedings of the International Symposium on Performance Analysis of Systems and Software, {ISPASS}, 2020, IEEE, pp. 110–112.

(2020). Distributed Non-Negative Tensor Train Decomposition. In: Proceedings of the High Performance Extreme Computing Conference, {HPEC}, IEEE, pp. 1–10.

(2020). An Out of Memory tSVD for Big-Data Factorization. In: IEEE Access, IEEE, 2020, vol. 8 pp. 107749–107759.

(2020). Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs. In Proceedings of the 17th ACM International Conference on Computing Frontiers, ACM, pp. 60–70.

(2020). PPT-SASMM: Scalable Analytical Shared Memory Model: Predicting the Performance of Multicore Caches from a Single-Threaded Execution Trace. In Proceedings of the International Symposium on Memory Systems, {MEMSYS}, 2020, ACM, pp. 341–351.

(2020). Fast, Accurate, and Scalable Memory Modeling of GPGPUs using Reuse Profiles. In Press: The 34th ACM International Conference on Supercomputing (ICS), 2020, ACM, pp. xx–xx.

(2020). Distributed non-negative matrix factorization with determination of the number of latent features. In: The Journal of Supercomputing, Springer, 2020, 76(9) pp. 7458–7488.

(2020). Decoy Selection in Protein Structure Determination via Symmetric Non-negative Matrix Factorization. In: Proceedings of the International Conference on Bioinformatics and Biomedicine, {BIBM}, IEEE, pp. 23–28.

(2020). Decoy Selection for Protein Structure Prediction Via Extreme Gradient Boosting and Ranking . In BMC Bioinformatics, pp. 189.

(2020). Code Characterization With Graph Convolutions and Capsule Networks. {IEEE} Access, vol. 8, 2020, IEEE, pp. 136307–136315.

(2019). Combating Label Noise in Deep Learning Using Abstention. In: International Conference on Machine Learning (ICML) pp. 6234–6243.

Preprint

(2019). {POSTER:} GPUs Pipeline Latency Analysis. In: 30th {IEEE} International Conference on Application-specific Systems, Architectures and Processors, {ASAP}, IEEE, pp. 139.

(2019). Unsupervised Learning for Decoy Selection in Protein Structure Prediction. Biophysical Journal.

(2019). Scalable Code Performance Prediction of Codes with Memory Hierarchy and Pipelines. In: ACM SIGSIM International Conference on Principles of Advanced Discrete Simulation (SIGSIM-PADS), ACM, pp. 13–24.

(2019). PPT-GPU: Scalable GPU Performance Modeling. In: IEEE Computer Architecture Letters, (18), 1, pp. 55–58.

DOI

(2019). Performance Prediction of loop parallel codes on multi-cores using shared memory models. In: Computer Architecture Letters.

(2019). On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks. In: arXiv preprint arXiv:1905.11001.

(2019). On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks. In: dvances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems, (NeurIPS) pp. 13888–13899.

(2019). Non-Negative Matrix Factorization for Selection of Near-Native Protein Tertiary Structures. In: International Conference on Bioinformatics and Biomedicine, {BIBM}, IEEE, pp. 70–73.

(2019). Multi-task Learning to Extract Tumor Genomic Markers from Breast Cancer Pathology Reports and Uncertainty Quantification. In: Journal of American Medical Informatics Association (JAMIA).

(2019). Knows When it Doesn’t Know: Deep Abstaining Classifiers. In: International Conference on Learning Representations (ICLR).

(2019). Instructions' Latencies Characterization for NVIDIA GPGPUs. In: IEEE HPEC, 2019 pp. 1–8_.

(2019). GPUs Cache Performance Estimation using Reuse Distance Analysis. In: 38th {IEEE} International Performance Computing and Communications Conference, {IPCCC}, IEEE, pp. 1–8.

(2019). Extended Parallel Seed-based Approach with Machine Learning-based Scoring Function and Graph Models for Protein Structure Prediction. In: Journal of Biomedical Informatics.

(2019). Decoy Selection for Protein Structure Prediction via Extreme Gradient Boosting and Ranking. In: BMC Bioinformatics.

(2018). Synthesis of Parallel Programs on Multi-Cores. In: Handbook on Grammatical Evolution, Springer, pp. 289–315.

(2018). Quantum Algorithm Implementations for Beginners. In: arXiv preprint arXiv:1804.03719.

(2018). PyPaast: Parallel Application Performance Prediction Using Memory Models and HPC Simulations. In: Transactions on Modeling and Computer Simulation (TOMACS).

(2018). PPT-GPU: Performance Prediction Toolkit for GPUs: Identifying the impact of caches. In: International Symposium on Memory Systems (MEMSYS), pp. 301–302.

(2018). Parallel Application Performance Prediction Using Analysis Based Models and HPC Simulations. In: Proceedings of the 2018 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, ACM, pp. 49–59.

(2018). Improved Decoy Selection via Machine Learning and Ranking. In: Proceedings of the 8th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), pp. 1–1.

(2018). IMCSIM: Parameterized performance prediction for implicit monte carlo codes. In: Winter Simulation Conference (WSC), pp. 491–502.

(2017). Probabilistic Monte Carlo simulations for static branch prediction. In: IEEE International Performance Computing and Communications Conference (IPCCC), pp. 1–4.

(2017). Performance prediction toolkit.

(2017). An analytical memory hierarchy model for performance prediction. In: Winter Simulation Conference (WSC), pp. 908–919.

(2017). AMM: scalable memory reuse model to predict the performance of physics codes. In: IEEE International Conference on Cluster Computing (CLUSTER), pp. 649–650.

(2017). A scalable analytical memory model for cpu performance prediction. In: International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, pp. 114–135.

(2017). A probabilistic monte carlo framework for branch prediction. In: IEEE International Conference on Cluster Computing (CLUSTER), pp. 651–652.

(2016). Automatic lock-free parallel programming on multi-core processors. In: IEEE Congress on Evolutionary Computation (CEC), pp. 4143–4150.

(2015). Synthesis of parallel iterative sorts with multi-core grammatical evolution. In: Genetic and evolutionary computation conference companion, pp. 1059–1066.

(2015). Performance optimization of multi-core grammatical evolution generated parallel recursive programs. In: Genetic and evolutionary computation conference, pp. 1007–1014.

(2015). On the automatic generation of efficient parallel iterative sorting algorithms. In: Genetic and evolutionary computation conference companion, pp. 1369–1370.

(2015). Automatic evolution of parallel sorting programs on multi-cores. In: European Conference on the Applications of Evolutionary Computation, pp. 706–717.

(2015). Automatic evolution of parallel recursive programs. In: European Conference on Genetic Programming, pp. 167–178.

(2014). Predict the success or failure of an evolutionary algorithm run. In: Genetic and evolutionary computation conference companion, pp. 131–132.

(2014). Predict the performance of GE with an ACO based machine learning algorithm. In: Genetic and evolutionary computation conference companion, pp. 1353–1360.

(2014). On The Efficiency of Multi-core Grammatical Evolution (MCGE) Evolving Multi-Core Pallel Programs. In: Sixth World Congress on Nature and Biologically Inspired Computing, pp. 238–243.

(2014). Multi-core GE: automatic evolution of CPU based multi-core parallel programs. In: Genetic and evolutionary computation conference companion, pp. 1041–1044.

(2014). Eant-miner: an ensemble ant-miner to improve the ACO classification. In: arXiv preprint arXiv:1409.2710.

(2013). An Empirical Analysis Through the Time Complexity of GE Problems. In: 19th International Conference on Soft Computing, MENDEL 2013, pp. 37–44.