Loading...

It's easy when you're doing things you love
I'm a Researcher | a Coder | a Hiker | a Photographer | an Amateur Penman

know more ..
Hi, I'm Debjyoti Paul

My friends call me Deb

Currently a PhD Student at School of Computing, University of Utah
I did my Masters of Technology in Computer Science from IIT Kanpur

Research Interests:

Large Scale Social Media Data Analytics, Machine Learning and Data Mining techniques, Deep Learning, Bayesian Learning, Text Analytics, Indexing techniques, Data Visualization.

Office Address

Room 2780 WEB building

72 S. Central Campus Drive

School of Computing, University of Utah

Salt Lake City, UT 84112

Education


University of Utah

Doctorate

Computer Science

GPA: 3.93/4.0

Indian Institute of Technology Kanpur

Master of Technology

Computer Science & Engineering

GPA: 8.67/10.0 (Rank: 3)

Institute of Engineering & Management

Bachelor of Technology (Rank < 10)

Computer Science & Engineering

GPA: 8.93/10.0

Experience


Research Assistant (2015-2017)

Advisor: Prof. Feifei Li

University of Utah

Teaching Assistant (2016-2017)

Natural Language Processing, Data Mining

University of Utah

Software Developer (2013-2015)

Data Platform Team: Exceeds Expectation for year 2014

Flipkart flipkart.com

Hackathon Awards


[2016] EMC2 {Code} Mars Challenge Hackathon

Winner

Out of 45 teams

[2015] Goldman Sachs Air Quality Hackathon

1st Runners-up

Out of 30 teams

[2014] InMobi Freedom Hack Worldwide Hackathon

1st Runners-up

Out of 160 teams

[2013] Yahoo HackU Hackathon

Winner

Out of 40 teams

Skills


Language: Python, Java
%
Language: C++,C
%
Databases: MySQL, HP Vertica, PostGRE SQL
%
Scripts: Javascript, Shell Script, D3.js, Three.js, LaTeX
%
Softwares etc.: Vim, OhMyZsh, IntellijIDEA, Eclipse, WebStorm
%
Tid-Bits: Apache Spark, Data Warehousing, Hadoop, Azkaban2 & Oozie (Exec engines), Maven
%

Publications

Debjyoti Paul, Sanjeev K. Aggarwal, Multi-objective Evolution based Dynamic Job Scheduler in Grid
The 8th International Conference on Complex, Intelligent, and Software Intensive Systems (CISIS 2014), July 2nd – 4th, 2014, Birmingham, UK.

[pdf] | [link]


Grid computing is a high performance computing environment to fulfill large-scale computational demands. It can integrate computational as well as storage resources from different networks and geographically dispersed organizations into a high performance computational & storage platform. It is used to solve complex computational-intensive problems, and also provide solution to storage-intensive applications with connected storage resources. Scheduling of user jobs properly on the heterogeneous resources is an important task in a grid computing environment. The main goal of scheduling is to maximize resource utilization, minimize waiting time of jobs, reduce energy consumption, minimize cost to the user after satisfying constraints of jobs and resources. We can trade off between the required level of quality of service, the deadline and the budget of user. In this paper, we propose a Multi-objective Evolution-based Dynamic Scheduler in Grid. Our scheduler have used Multi-objective optimization technique using Genetic algorithm with pareto front approach to find efficient schedules. It explores the search space vividly to avoid stagnation and generate near optimal solution. We propose that our scheduler provides a better grip on most features of grid from perspective of grid owner as well as user. Dynamic grid environment has forced us to make it a real time dynamic scheduler. A job grouping technique is proposed for grouping fine-grained jobs and for ease of computation. Experimentation on different data sets and on various parameters revealed effectiveness of multi- objective scheduling criteria and extraction of performance from grid resource.

Manash Pal, Arnab Bhattacharya, Debjyoti Paul, RCached-tree: An Index Structure for Efficiently Answering Popular Queries
ACM International Conference on Information and Knowledge Management (CIKM 2013), Oct. 27–Nov. 1, 2013, San Francisco, CA, USA.

[pdf] | [link]


In many applications of similarity searching in databases, a set of similar queries appear more frequently. Since it is rare that a query point with its associated parameters (range or number of nearest neighbors) will repeat exactly, intelligent caching mechanisms are required to efficiently answer such queries. In addition, the performance of non-repeating and non-cached queries should not suffer too much either. In this paper, we propose RCached-tree, belonging to the family of R-trees, that aims to solve this problem. In every internal node of the tree up to a certain level, a portion of the space is reserved for storing popular queries and their solutions. For a new query that is encompassed by a cached query, this enables bypassing the traversal of lower levels of the subtree corresponding to the node as the answers can be obtained directly from the result set of the cached query. The struc- ture adapts itself to varying query patterns; new popular queries replace the old cached ones that are not popular any more. Queries that are not popular as well as insertions, deletions and updates are handled in the same manner as in a general R-tree. Experiments show that the RCached-tree can outperform R-tree and other such structures by a signif- icant margin when the proportion of popular queries is 20% or more by reserving 30-40% of the internal nodes as cache.

Debjyoti Paul, Sumana Basu, Sukanya Ghosh, Lightweight Security Enhancement Protocol for Radio Frequency Identification(RFID)
Proceedings of International Conference on Scientific Paradigm Shift In Information Technology & Management (SPSITM 2011), January 2011, Kolkata, INDIA.

[pdf] | [google scholar]


Though RFID provides automatic object identification, yet it is vulnerable to various security threats that put consumer and organization privacy at stake. In this work, we have considered some existing security protocols of RFID system and analyzed the possible security threats at each level. We have modified those parts of protocol that have security loopholes and thus finally proposed a modified four-level security model that has the potential to provide fortification against security threats.

Debjyoti Paul, Sumana Basu, Punit Beriwal, Multilevel Security Protocol using Radio Frequency Identification
IEEE Paper, International Conference on Emerging Trends in Mathematics and Computer Applications–2010 Page no-544 to 547 , Sivakasi, Tamil Nadu.

[pdf]


Projects

Twitter Election 2016 Sentiment Analysis, What Twitter says!

MusicAtlas - Music Wordwide!

| |

Question Answering System

Online Topic Discovery via Online Clustering

|

AirQuality @ Utah

|

Metonym: Learn vocabulary with Wordweb interactively

IntelliAd: A Social Media driven Intelligent Ad-Targeting framework using Geo-profiling

Dart News: A street news browsing application with an interactive GIS interface

|

Multi-objective Evolution based Dynamic Job Scheduler in Grid

| |

RCached-tree: An Index Structure for Efficiently Answering Popular Queries

|

STaCHIT : Smart TimeLine and Chit-Chat (Yahoo HackU Winner 2012)

Real time discrimination of Speech and Music

Cryptography-Diffie-Hellman Key Exchange Through Elliptic Curve Method

Achievements

Spatio-temporal Sentiment Analysis Project estorm.org analyzed the sentiment of common people on US Election. It gained a lot of media coverage. (2016)
| | | | | | | | | | | and many more..
Building a National Neighborhood Dataset From Geotagged Twitter Data for Indicators of Happiness, Diet, and Physical Activity. JMIR 2016. Amassed a lot of media attention.
| | | | | | | | | |
Ranked 3rd out of 39 M.Tech students of CSE department in Indian of Institute Technology, Kanpur M.Tech (2011-2013)
Secured All India Rank 228 in GATE 2012 among 1.56 lakh participants of Computer Science & Information Technology department. (2011-2012)
Achieved All India Rank of 7 in Indian Space Research Organization (ISRO) recruitment exam. (2011-2012)
Secured All India Rank 223 in GATE 2011 among 1.36 lakh participants of Computer Science & Information Technology department. (2010-2011)
Amongst top 10 student of CSE department in Institute of Engineering Management, Kolkata, and awarded academic excellence for performance in B.Tech (2007-2011)
Acknowledged as the Best Project by the course professor for “Boosting performance of popular queries” which was done as part of CS618 (Indexing and Searching of Databases) course. (2011-2012)
Got 2nd Rank in Project Fair in Bits to Bytes 2008, in Most Economic Autonomous Line Follower Robot and 3rd position in Dzyan IEM Techfest 2008 in Roborally event.
Awarded 2nd prize for academic excellence in school in Class XII
Awarded 1st prize for academic excellence in school in Class XI
Awarded academic excellence certificate in district level for performance in Board Exam

things interest me

Timelapse Videos
Images: got so many likes..

more
Resume