There isn't a known analytic way to solve this problem, but we note that the problem can be broken down in 2 different dimensions. One break the problem down by weight by saying, "What if the thief could only carry 1 pound? What about 2 pounds? 3 pounds?" One could also break it down by items by saying, "What if the thief only knew about the first 2 items? first 3 items?" Now we know we can use dynamic programming to use those broken down solutions to find our desired, big solution for all values and maximum weight.
Base Case
Most dynamic programming problems, including this one, require a base case. In this problem, the base case is fairly intuitive. We know for sure that if the thief has a maximum weight of 0, then he can't get any value at all. We also know that if he has seen 0 items, then he also can't get any value. This is our starting point.
Recurrence Relation
This is the heavy part of the explanation, so follow closely. We're going to make a table to hold all of our little solutions - we'll call this table dp. We'll say that dp is two dimensional, like a basic Excel spreadsheet. Let's say the number of rows = the length of values[] + 1 (the +1 is to account for the base case) and the number of cols = maximum weight. Notice that this lines up with our supposing in the introductory paragraph, "What about 2,3,4 pounds? What about 1st, 2nd, 3rd items?" etc.
First we define what each entry in the table means. dp[i][j] for all i, j indices in the table is equal to the maximum value that the thief can obtain having looked at the first i items, and constrained by j maximum weight.
Now we have to form the recurrence relation which describes how we combine the subproblems into the big solution. As we look at each next item, if the item's weight is larger than j (our max weight), then we can't take it because it's too heavy - we have to leave it. If it's not larger than j, then we can either take it or leave it. If we take it, then we take the best solution we had for j - weight[i - 1], + the value of the item we took, such that takeIt = dp[i][j - weights[i - 1]] + values[i - 1]. Why weights[i - 1] and values[i - 1]? Because we augmented our table for the base case. If we leave it, then our best value is what we had the last time (see how we're tapping into our subproblems?) so dp[i][j] = dp[i - 1][j]. We want the maximum score between taking it and leaving it. So the algorithm is as follows:
is j >= weights[i - 1]?
yes:
takeIt = dp[i][j - weights[i - 1]] + values[i - 1];
leaveIt = dp[i - 1][j];
dp[i][j] = max(takeIt, leaveIt);
no:
dp[i][j] = dp[i - 1][j]
Pseudocode Solution
Supposing we have maximum weight set to 20, values as [20, 5, 6] and weights as [10, 5, 6]:
values = [20, 5, 6]
weights = [10, 5, 6]
MAX_WEIGHT = 20
dp = array[values.length + 1][MAX_WEIGHT + 1]
numRows = values.length + 1
numCols = MAX_WEIGHT + 1
# Loop through the rows and columns
for i in [0...numRows]:
for j in [0...numCols]:
# Establish the base case
if i == 0 or j == 0:
dp[i][j] = 0
continue;
# Use the recurrence relation
if j >= weights[i - 1]:
takeIt = dp[i - 1][j - weights[j - 1]] + values[i - 1]
leaveIt = dp[i - 1][j]
dp[i][j] = max(takeIt, leaveIt)
else:
dp[i][j] = dp[i - 1][j]
print "the maximum value the thief can get with his knapsack is " + dp[values.length][MAX_WEIGHT]
Seems like magic.