Handy Haversacks

Published: 2020-12-08 Original Prompt

Part 1

All right, we’re really getting into it now.

The question we ultimately need to answer for part 1 is:

given a bag color, can it (or any of the bags it can hold) hold “shiny gold”?

First, we have to model this data. For part 1 we don’t need to worry about the bag quantities, so we can ignore them.

Luckily, each line follows a partially predictable pattern:

The first part of the line is regular enough that we can pull it out:

for line in self.input:
    self.mapping = {}
    # light red bags contain 1 bright white bag, 2 muted yellow bags.

    # this_bag holds these_bags
    this_bag, held_bags = s.split(' bags contain ')

    if 'no other bags' in held_bags:
        pass

    # ['light red', '1 bright white bag, 2 muted yellow bags']

    held_bags = held_bags.split(', ')

    # ['1 bright white bag', '2 muted yellow bags']

    these_bags = []
    # just grab the adjective and color
    for bag in held_bags:
        bag_parts = bag.split(' ')
        these_bags.append(' '.join(bag_parts[1:3]))

    # ['bright white', 'muted yellow']

    self.mapping[this_bag] = these_bags

Now that we’ve got the parts split out, we need to store it. A string that maps to a list of string sounds a lot like a Dict[str, List[str]] to me! Let’s store that as self.mapping (if your solution is in a class, like mine. Otherwise, declare mapping = {} at the very top of the file).

Now that we’ve got our dict, we need to check what each bag holds all the way down until we get to bags that don’t hold anything (such as faded blue in the example).

Whenever you call a function repeatedly until you reach a “base case” is textbook recursion. This can be an intimidating subject, but as long as you follow a couple of simple rules, you’ll be in great shape!

Rule 1: always start with your base case

In this function, we want to stop recursing if we definitively know whether or not the current bag can hold the gold bag:

def can_hold_gold(self, bag) -> bool:
    if self.mapping[bag] == []:
        return False # definitely no
    if 'shiny gold' in self.mapping[bag]:
        return True # definitely yes

Assuming there’s not a loop in the bags (red holds blue, which holds red), we’ll eventually reach one of these two paths and our function will stop recursing. Awesome.

Rule 2: return a calculation involving the function

Now the heady part. Given a bag that we don’t have a definitive answer on, how do we calculate it?

We need to check if any bag that our bag holds can in turn hold shiny gold. So, we should return True if any bag in self.mapping[bag] returns true. Easy enough in code:

...
return any([self.can_hold_gold(color) for color in self.mapping[bag]])

With those two sections, we can call can_hold_gold and count the number of times it returns True:

return [self.can_hold_gold(bag) for bag in self.mapping].count(True)

All together, my solution ran in 0.363 seconds. Not too shabby!

Part 1: Extra Credit

But, we can do better with the magic of ✨Dynamic Programming✨. If that doesn’t tickle your fancy, feel free to jump ahead to Part 2. Otherwise, read on.

Let’s take a closer look at the first two lines of the example:

light red bags contain 1 bright white bag, 2 muted yellow bags. dark orange bags contain 3 bright white bags, 4 muted yellow bags.

After calculating whether bright yellow can hold shiny gold in the first line (it can), we get to the second line and… start calculating whether bright yellow can hold shiny gold! In the little example this isn’t a big deal, but in actual puzzle input, this could be very costly.

We can take advantage of the fact that a bag being able to hold shiny gold never changes. So once we know the answer, we can cache it in the mapping to speed up future checks of the same bag:

def can_hold_gold(self, bag) -> bool:
    if isinstance(self.mapping[bag], bool):
        return self.mapping[bag]

    if not self.mapping[bag]:
        res = False
    elif "shiny gold" in self.mapping[bag]:
        res = True
    else:
        res = any([self.can_hold_gold(color) for color in self.mapping[bag]])

    # cache results for later
    self.mapping[bag] = res
    return res

The first time a function is called for a bag, it does the full calculation for every bag in its tree. The next time any of those bags are checked (as part of any other bag), the response will be instant.

Now my solution runs in 0.016 seconds, a 22.7x improvement.

Breaking complex problems into small problems and caching the results is a technique called dynamic programming. It can lead to huge gains in the speed of recursive algorithms. A valuable tool to have in your toolbelt!

The complete part 1 solution is here. We change it in part 2, so use that link to see it before changes.

Part 2

Now that we have to start paying attention to the number of bags, we need a little more organization. To store that, we’ll use Python’s dataclasses, which allow for convenient storage of data:

from dataclasses import dataclass, field

@dataclass
class HoldInfo:
    color: str
    num: int


@dataclass
class BagInfo:
    color: str
    # shortcuts
    can_hold_gold: Optional[bool] = None
    num_bags_held: Optional[int] = None
    # which bags this holds. made obsolete by num_bags_held, once calculated
    held_bags: List[HoldInfo] = field(default_factory=list)

We can tweak our code from part 1 to use this new class without much effort:

bag_info = BagInfo('dark blue')

# empty bag
bag_info.can_hold_gold = False
bag_info.num_bags_held = 0

# full bag
bag_info.held_bags = [HoldInfo('dark blue', 3), ...]

Just like before, if bag_info.can_hold_gold is None, we need to calculate and store the result. Then, either way, we can return that value. We’ll take a similar approach with num_bags_held.

By default, the value of an empty bag is 1 (the bag itself). We add to that the value of each bag it holds * the quantity of that bag.

Concisely, that’s calculated as:

bag_info.num_bags_held = 1 + sum(
    [
        hold_info.num * self.num_bags_held(hold_info.color)
        for hold_info in bag_info.held_bags
    ]
)

In our root method, we can call self.value_of_bag('shiny gold') for our answer… almost.

This function returns the value of a bag relative to the one holding it. A sky blue bag holding a shiny gold one would be worth value(shiny gold) + 1. So if we want the value of the shiny gold bag by itself, we have to return self.value_of_bag("shiny gold") - 1. I was stuck on that bug for quite a while. But, we got there in the end.