<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Suresh Venkatasubramanian &#187; Papers</title>
	<atom:link href="http://www.cs.utah.edu/~suresh/web/category/papers/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cs.utah.edu/~suresh/web</link>
	<description></description>
	<lastBuildDate>Mon, 23 Sep 2013 07:42:03 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Clustering with center constraints</title>
		<link>http://www.cs.utah.edu/~suresh/web/2013/09/17/clustering-with-center-constraints/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2013/09/17/clustering-with-center-constraints/#comments</comments>
		<pubDate>Tue, 17 Sep 2013 18:29:15 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=453</guid>
		<description><![CDATA[Parinya Chalermsook and Suresh Venkatasubramanian In IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTICS), 2013 Abstract: In the classical maximum independent set problem, we are given a graph G of “conflicts” and are asked to find a maximum conflict-free subset. If we think of the remaining nodes as being “assigned” [...]]]></description>
				<content:encoded><![CDATA[<p>Parinya Chalermsook and Suresh Venkatasubramanian</p>
<p>In<a href="http://www.fsttcs.org/"> IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science</a> (FSTTICS), 2013</p>
<p><span id="more-453"></span></p>
<p><strong>Abstract</strong>:</p>
<div title="Page 1">
<p>In the classical maximum independent set problem, we are given a graph G of “conflicts” and are asked to find a maximum conflict-free subset. If we think of the remaining nodes as being “assigned” (at unit cost each) to one of these independent vertices and ask for an assignment of minimum cost, this yields the vertex cover problem. In this paper, we consider a more general scenario where the assignment costs might be given by a distance metric d (which can be unrelated to G) on the underlying set of vertices. We call this problem minimum edge- weighted independent set. This problem, in addition to being a natural generalization of vertex cover and an interesting variant of matroid median problem, also has connection to constrained clustering and database repair.</p>
<p>Understanding the relation between the conflict structure (the graph) and the distance structure (the metric) for this problem turns out to be the key to isolating its complexity. We show that when the two structures are unrelated, the problem inherits a trivial upper bound from vertex cover and provide an almost matching lower bound on hardness of approximation. We then prove a number of lower and upper bounds that depend on the relationship between the two structures, including polynomial time algorithms for special graphs.</p>
<p>Links: <a href="http://www.cs.utah.edu/~suresh/papers/mwis/mwis.pdf">PDF</a></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2013/09/17/clustering-with-center-constraints/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Power to the points: validating data memberships in clusterings</title>
		<link>http://www.cs.utah.edu/~suresh/web/2013/09/17/power-to-the-points-local-certificates-for-clustering/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2013/09/17/power-to-the-points-local-certificates-for-clustering/#comments</comments>
		<pubDate>Tue, 17 Sep 2013 16:18:37 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CCF 0953066]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=448</guid>
		<description><![CDATA[[author]Parasaran Raman and Suresh Venkatasubramanian[/author] Proc. IEEE International Conference on Data Mining, 2013 (ICDM) Abstract: In this paper, we present a method to attach affinity scores to the implicit labels of individual points in a clustering. The affinity scores capture the confidence level of the cluster that claims to &#8220;own&#8221; the point. We demonstrate that [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Parasaran Raman and Suresh Venkatasubramanian[/author]</p>
<p><a href="http://icdm2013.rutgers.edu/">Proc. IEEE International Conference on Data Mining, 2013 (ICDM)</a></p>
<p><span id="more-448"></span></p>
<p><strong>Abstract</strong>:</p>
<p>In this paper, we present a method to attach affinity scores to the implicit labels of individual points in a clustering. The affinity scores capture the confidence level of the cluster that claims to &#8220;own&#8221; the point. We demonstrate that these scores accurately capture the quality of the label assigned to the point. We also show further applications of these scores to estimate <em>global</em> measures of clustering quality, as well as accelerate clustering algorithms by orders of magnitude using active selection based on affinity.</p>
<p>This method is very general and applies to clusterings derived from any geometric source. It lends itself to easy visualization and can prove useful as part of an interactive visual analytics framework. It is also efficient: assigning an affinity score to a point depends only polynomially on the number of clusters <em>and is independent both of the size and dimensionality of the data</em>. It is based on techniques from the theory of interpolation, coupled with sampling and estimation algorithms from high dimensional computational geometry.</p>
<p><iframe style="border: 0px none transparent;" src="http://www.ustream.tv/embed/recorded/38896271?v=3&amp;wmode=direct" height="302" width="480" frameborder="0" scrolling="no"></iframe></p>
<p><a style="padding: 2px 0px 4px; width: 400px; background: #ffffff; display: block; color: #000000; font-weight: normal; font-size: 10px; text-decoration: underline; text-align: center;" href="http://www.ustream.tv/" target="_blank">Video streaming by Ustream</a></p>
<p>Links: (<a href="http://arxiv.org/abs/1305.4757">older arXiv version</a>: <a href="http://www.cs.utah.edu/~suresh/papers/stable/stable.pdf">submitted version</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2013/09/17/power-to-the-points-local-certificates-for-clustering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Moving heaven and earth: distances between distributions</title>
		<link>http://www.cs.utah.edu/~suresh/web/2013/09/16/moving-heaven-and-earth-distances-between-distributions/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2013/09/16/moving-heaven-and-earth-distances-between-distributions/#comments</comments>
		<pubDate>Mon, 16 Sep 2013 21:17:02 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[Column]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=444</guid>
		<description><![CDATA[[author]Suresh Venkatasubramanian[/author] SIGACT News vol 44, no. 3. Abstract: This column comes in two parts. In the first, I discuss various ways of defining distances between distributions. In the second, Jeff Erickson (chair of the SoCG steering committee) discusses some matters related to the relationship between ACM and the Symposium on Computational Geometry. Links: PDF]]></description>
				<content:encoded><![CDATA[<p>[author]Suresh Venkatasubramanian[/author]<br />
SIGACT News vol 44, no. 3.<br />
<span id="more-444"></span></p>
<p>Abstract:</p>
<div title="Page 1">
<div>
<div>
<p>This column comes in two parts. In the first, I discuss various ways of defining distances between distributions. In the second, Jeff Erickson (chair of the SoCG steering committee) discusses some matters related to the relationship between ACM and the Symposium on Computational Geometry.</p>
<p>Links: <a href="http://www.cs.utah.edu/~suresh/papers/column/kernel/kernel.pdf">PDF</a></p>
</div>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2013/09/16/moving-heaven-and-earth-distances-between-distributions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Developments in Matrix Factorization.</title>
		<link>http://www.cs.utah.edu/~suresh/web/2013/03/13/new-developments-in-matrix-factorization/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2013/03/13/new-developments-in-matrix-factorization/#comments</comments>
		<pubDate>Thu, 14 Mar 2013 04:33:18 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[Column]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=404</guid>
		<description><![CDATA[SIGACT News 44 (1), March 2013. Notes: the published version of the article has a few errors. Moitra&#8217;s SODA paper improves the running time from $O((nm)^{2^r r^2})$ to $O((nm)^{r^2})$, which is considerably stronger than what was reported in the article. The correct lower bound for computing a nonnegative factorization (assuming ETH) is $O((nm)^{o(r)})$. This link [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://dl.acm.org/citation.cfm?doid=2447712.2447732">SIGACT News 44 (1), March 2013.</a></p>
<p>Notes: the published version of the article has a few errors.</p>
<ul>
<li>Moitra&#8217;s SODA paper improves the running time from $O((nm)^{2^r r^2})$ to $O((nm)^{r^2})$, which is considerably stronger than what was reported in the article.</li>
<li>The correct lower bound for computing a nonnegative factorization (assuming ETH) is $O((nm)^{o(r)})$.</li>
</ul>
<p>This link (<a href="http://www.cs.utah.edu/~suresh/papers/column/nmf/nmf-mod.pdf">PDF</a>) has the corrected article. Thanks to Ankur Moitra for pointing out the errors.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2013/03/13/new-developments-in-matrix-factorization/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Multiple Target Tracking with RF Sensor Networks</title>
		<link>http://www.cs.utah.edu/~suresh/web/2013/01/24/multiple-target-tracking-with-rf-sensor-networks/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2013/01/24/multiple-target-tracking-with-rf-sensor-networks/#comments</comments>
		<pubDate>Thu, 24 Jan 2013 07:18:32 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CPS 1035565]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=336</guid>
		<description><![CDATA[[author]Maurizio Bocca, Ossi Kaltiokallio, Neal Patwari and Suresh Venkatasubramanian.[/author] To appear in the IEEE Transactions on Mobile Computing. http://arxiv.org/abs/1302.4720 &#160; Abstract: RF sensor networks are wireless networks that can localize and track people (or targets) without needing them to carry or wear any electronic device. They use the change in the received signal strength (RSS) [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Maurizio Bocca, Ossi Kaltiokallio, Neal Patwari and Suresh Venkatasubramanian.[/author]</p>
<p><em>To appear in the <a href="http://www.computer.org/portal/web/tmc">IEEE Transactions on Mobile Computing</a>.</em></p>
<p><em><a href="http://arxiv.org/abs/1302.4720">http://arxiv.org/abs/1302.4720</a></em></p>
<p>&nbsp;</p>
<p><span id="more-336"></span></p>
<p><strong>Abstract:</strong></p>
<div title="Page 1">
<blockquote><p>RF sensor networks are wireless networks that can localize and track people (or targets) without needing them to carry or wear any electronic device. They use the change in the received signal strength (RSS) of the links due to the movements of people to infer their locations. In this paper, we consider real-time multiple target tracking with RF sensor networks. We perform radio tomographic imaging (RTI), which generates images of the change in the propagation field, as if they were frames of a video. Our RTI method uses RSS measurements on multiple frequency channels on each link, combining them with a fade level-based weighted average. We describe methods to adapt machine vision methods to the peculiarities of RTI to enable real time multiple target tracking. Several tests are performed in an open environment, a one-bedroom apartment, and a cluttered office environment. The results demonstrate that the system is capable of accurately tracking in real-time up to 4 targets in cluttered indoor environments, even when their trajectories intersect multiple times, without mis-estimating the number of targets found in the monitored area. The highest average tracking error measured in the tests is 0.45 m with two targets, 0.46 m with three targets, and 0.55 m with four targets.</p></blockquote>
</div>
<p>Links: Coming soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2013/01/24/multiple-target-tracking-with-rf-sensor-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sensor Network Localization for Moving Sensors</title>
		<link>http://www.cs.utah.edu/~suresh/web/2012/10/15/sensor-network-localization-for-moving-sensors/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2012/10/15/sensor-network-localization-for-moving-sensors/#comments</comments>
		<pubDate>Mon, 15 Oct 2012 17:44:24 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CCF 0953066]]></category>
		<category><![CDATA[CCF 1115677]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=287</guid>
		<description><![CDATA[[author]Arvind Agarwal, Hal Daume III, Jeff M. Phillips, Suresh Venkatasubramanian[/author] The Second IEEE ICDM Workshop on Data Mining in Networks Abstract: Sensor network localization (SNL) is the problem of determining the locations of the sensors given sparse and usually noisy inter-communication distances among them. In this work we propose an iterative algorithm named PLACEMENT to [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Arvind Agarwal, Hal Daume III, Jeff M. Phillips, Suresh Venkatasubramanian[/author]<br />
<em><a href="http://damnet.reading.ac.uk/">The Second IEEE ICDM Workshop on Data Mining in Networks</a></em></p>
<p><span id="more-287"></span></p>
<p>Abstract:</p>
<blockquote><p>Sensor network localization (SNL) is the problem of determining the locations of the sensors given sparse and usually noisy inter-communication distances among them. In this work we propose an iterative algorithm named PLACEMENT to solve the SNL problem.<br />
This iterative algorithm requires an initial estimation of the locations and in each iteration, is guaranteed to reduce the cost function. The proposed algorithm is able to take advantage of the good initial estimation of sensor locations making it suitable for localizing moving sensors, and also suitable for the reﬁnement of the results produced by other algorithms. Our algorithm is very scalable. We have<br />
experimented with a variety of sensor networks and have shown that the proposed algorithm outperforms existing algorithms both in terms of speed and accuracy in almost all experiments. Our algorithm can embed 120,000 sensors in less than 20 minutes.</p>
</blockquote>
<p>Links: <a href="http://www.cs.utah.edu/~suresh/papers/damnet/damnet.pdf">PDF</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2012/10/15/sensor-network-localization-for-moving-sensors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Radio Tomographic Imaging and Tracking of Stationary and Moving People via Histogram Difference</title>
		<link>http://www.cs.utah.edu/~suresh/web/2012/07/18/radio-tomographic-imaging-and-tracking-of-stationary-and-moving-people-via-histogram-difference/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2012/07/18/radio-tomographic-imaging-and-tracking-of-stationary-and-moving-people-via-histogram-difference/#comments</comments>
		<pubDate>Wed, 18 Jul 2012 16:05:22 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CPS 1035565]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=283</guid>
		<description><![CDATA[[author]Yang Zhao, Neal Patwari, Jeff Phillips and Suresh Venkatasubramanian[/author] IPSN, 2013 Abstract: Device-free localization systems pinpoint and track people in buildings using changes in the signal strength measurements made on wireless devices in the building&#8217;s wireless network. It has been shown that such systems can locate people who do not participate in the system by [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Yang Zhao, Neal Patwari, Jeff Phillips and Suresh Venkatasubramanian[/author]<br />
<a href="http://ipsn.acm.org/2013/"><em>IPSN, 2013</em></a></p>
<p><span id="more-283"></span><br />
<strong>Abstract</strong>:</p>
<blockquote><p>Device-free localization systems pinpoint and track people in buildings using changes in the signal strength measurements made on wireless devices in the building&#8217;s wireless network. It has been shown that such systems can locate people who do not participate in the system by wearing any radio device, even through walls, because of the changes that moving people cause to the static wireless network. However, many such systems cannot locate stationary people. We present and evaluate a system which can locate stationary or moving people, with or without calibration, by quantifying the difference between two histograms of signal strength measurements. From five experiments, we show that our kernel distance-based radio tomographic localization system performs better than the state-of-the-art device-free localization systems in different non line-of-sight environments.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2012/07/18/radio-tomographic-imaging-and-tracking-of-stationary-and-moving-people-via-histogram-difference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Efficient Protocols for Distributed Classification and Optimization</title>
		<link>http://www.cs.utah.edu/~suresh/web/2012/04/16/efficient-protocols-for-distributed-classification-and-optimization/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2012/04/16/efficient-protocols-for-distributed-classification-and-optimization/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 02:25:28 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CCF 0953066]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=275</guid>
		<description><![CDATA[[author]Hal Daume III, Jeff M. Phillips, Avishek Saha, Suresh Venkatasubramanian[/author] Proc. 23rd International Conference on Algorithmic Learning Theory (ALT), 2012. arXiv:1204.3523v1 [cs.LG] Abstract: In distributed learning, the goal is to perform a learning task over data distributed across multiple nodes with minimal (expensive) communication. Prior work (Daume III et al., 2012) proposes a general model [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Hal Daume III, Jeff M. Phillips, Avishek Saha, Suresh Venkatasubramanian[/author]<br />
<a href="http://www-alg.ist.hokudai.ac.jp/~thomas/ALT12/index.html">Proc. 23rd International Conference on Algorithmic Learning Theory (ALT), 2012.</a><br />
<a href="http://arxiv.org/abs/1204.3523">arXiv:1204.3523v1 [cs.LG]</a></p>
<p><span id="more-275"></span><br />
<strong>Abstract:</strong></p>
<blockquote><p>In distributed learning, the goal is to perform a learning task over data distributed across multiple nodes with minimal (expensive) communication. Prior work (Daume III et al., 2012) proposes a general model that bounds the communication required for learning classifiers while allowing for $\eps$ training error on linearly separable data adversarially distributed across nodes.</p>
<p>In this work, we develop key improvements and extensions to this basic model. Our first result is a two-party multiplicative-weight-update based protocol that uses $O(d^2 \log{1/\eps})$ words of communication to classify distributed data in arbitrary dimension $d$, $\eps$-optimally. This readily extends to classification over $k$ nodes with $O(kd^2 \log{1/\eps})$ words of communication. Our proposed protocol is simple to implement and is considerably more efficient than baselines compared, as demonstrated by our empirical results.<br />
In addition, we illustrate general algorithm design paradigms for doing efficient learning over distributed data. We show how to solve fixed-dimensional and high dimensional linear programming efficiently in a distributed setting where constraints may be distributed across nodes. Since many learning problems can be viewed as convex optimization problems where constraints are generated by individual points, this models many typical distributed learning scenarios. Our techniques make use of a novel connection from multipass streaming, as well as adapting the multiplicative-weight-update framework more generally to a distributed setting. As a consequence, our methods extend to the wide range of problems solvable using these techniques. </p>
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2012/04/16/efficient-protocols-for-distributed-classification-and-optimization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>On minimizing budget and time in influence propagation over social networks</title>
		<link>http://www.cs.utah.edu/~suresh/web/2012/03/21/on-minimizing-budget-and-time-in-influence-propagation-over-social-networks/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2012/03/21/on-minimizing-budget-and-time-in-influence-propagation-over-social-networks/#comments</comments>
		<pubDate>Thu, 22 Mar 2012 06:25:02 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=418</guid>
		<description><![CDATA[Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan, Suresh Venkatasubramanian Social Network Analysis and Mining, Mar 2012. Abstract: In recent years, study of influence propagation in social networks has gained tremendous attention. In this context, we can identify three orthogonal dimensions—the number of seed nodes activated at the beginning (known as budget), the expected number of activated nodes at the end [...]]]></description>
				<content:encoded><![CDATA[<p>Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan, Suresh Venkatasubramanian<br />
<a href="http://link.springer.com/article/10.1007%2Fs13278-012-0062-z">Social Network Analysis and Mining</a>, Mar 2012.</p>
<p><span id="more-418"></span></p>
<p>Abstract:</p>
<p>In recent years, study of influence propagation in social networks has gained tremendous attention. In this context, we can identify three orthogonal dimensions—the number of <em>seed</em> nodes activated at the beginning (known as <em>budget</em>), the expected number of activated nodes at the end of the propagation (known as <em>expected spread</em> or <em>coverage</em>), and the <em>time</em> taken for the propagation. We can constrain one or two of these and try to optimize the third. In their seminal paper, Kempe et al. constrained the budget, left time unconstrained, and maximized the coverage: this problem is known as <em>Influence Maximization</em> (or MAXINF for short). In this paper, we study alternative optimization problems which are naturally motivated by resource and time constraints on viral marketing campaigns. In the first problem, termed <em>minimum target set selection</em> (or MINTSS for short), a coverage threshold η is given and the task is to find the <em>minimum size seed set</em> such that by activating it, at least η nodes are eventually activated in the expected sense. This naturally captures the problem of deploying a viral campaign on a budget. In the second problem, termed MINTIME, the goal is to minimize the time in which a predefined coverage is achieved. More precisely, in MINTIME, a coverage threshold η and a budget threshold <em>k</em> are given, and the task is to find a seed set of size at most <em>k</em> such that by activating it, at least η nodes are activated in the expected sense, <em>in the minimum possible time</em>. This problem addresses the issue of <em>timing</em> when deploying viral campaigns. Both these problems are <strong>NP</strong>-hard, which motivates our interest in their approximation. For MINTSS, we develop a simple greedy algorithm and show that it provides a bicriteria approximation. We also establish a generic hardness result suggesting that improving this bicriteria approximation is likely to be hard. For MINTIME, we show that even bicriteria and tricriteria approximations are hard under several conditions. We show, however, that if we allow the budget for number of seeds <em>k</em> to be boosted by a logarithmic factor and allow the coverage to fall short, then the problem can be solved <em>exactly</em> in PTIME, i.e., we can achieve the required coverage within the time achieved by the optimal solution to MINTIME with budget <em>k</em> and coverage threshold η. Finally, we establish the value of the approximation algorithms, by conducting an experimental evaluation, comparing their quality against that achieved by various heuristics.</p>
<p>Links: <a href="http://link.springer.com/article/10.1007%2Fs13278-012-0062-z">Journal site</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2012/03/21/on-minimizing-budget-and-time-in-influence-propagation-over-social-networks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Protocols for Learning Classifiers on Distributed Data</title>
		<link>http://www.cs.utah.edu/~suresh/web/2011/12/12/protocols-for-learning-classifiers-on-distributed-data/</link>
		<comments>http://www.cs.utah.edu/~suresh/web/2011/12/12/protocols-for-learning-classifiers-on-distributed-data/#comments</comments>
		<pubDate>Mon, 12 Dec 2011 21:53:20 +0000</pubDate>
		<dc:creator>suresh</dc:creator>
				<category><![CDATA[Papers]]></category>
		<category><![CDATA[CCF 0953066]]></category>

		<guid isPermaLink="false">http://www.cs.utah.edu/~suresh/web/?p=260</guid>
		<description><![CDATA[[author]Hal Daumé, Jeff M. Phillips, Avishek Saha and Suresh Venkatasubramanian[/author] In the 15th International Conference on Artificial Intelligence and Statistics (AISTATS), 2012. Abstract: We consider the problem of learning classifiers for labeled data that has been distributed across several nodes. Our goal is to find a single classifier, with small approximation error, across all datasets [...]]]></description>
				<content:encoded><![CDATA[<p>[author]Hal Daumé, Jeff M. Phillips, Avishek Saha and Suresh Venkatasubramanian[/author]<br />
In the <a href="http://www.aistats.org/">15th International Conference on Artificial Intelligence and Statistics</a> (AISTATS), 2012.</p>
<p><span id="more-260"></span><br />
<strong>Abstract:</strong></p>
<p>We consider the problem of learning classifiers for labeled data that has been distributed across several nodes. Our goal is to find a single classifier, with small approximation error, across all datasets while minimizing the communication between nodes. This setting models real-world communication bottlenecks in the processing of massive distributed datasets.  We present several very general sampling-based solutions as well as some two-way protocols which have a provable exponential speed-up over any one-way protocol. We focus on core problems for <em>noiseless</em> data distributed across two or more nodes. The techniques we introduce are reminiscent of active learning, but rather than actively probing labels, nodes actively communicate with each other, each node simultaneously learning the important data from another node. </p>
<p>Links: <a href="http://www.cs.utah.edu/~suresh/papers/active/active.pdf">PDF </a>(this is the submitted version, not the final accepted version)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cs.utah.edu/~suresh/web/2011/12/12/protocols-for-learning-classifiers-on-distributed-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>