Beacon Scanner

Published: 2022-03-06 Original Prompt

Part 1

This is one I avoided as long as I could. It wasn’t one I had any idea how to approach. After spinning my wheels on it, I caved and checked out the solution thread. Turns out that the “easy” solution was to know linear algebra, which is something I’ve got absolutely no experience with. The tl;dr is that it’s a branch of mathematics that uses matrices to solve for x and y in sets of linear functions. For a reason I don’t quite grok, it’t super applicable to other engineering fields. Luckily, it didn’t end up being required today, just helpful. Let’s walk through this solution by /u/p88h to see how it’s solvable with good, old-fashioned programming.

First, input parsing! Not too bad today, just a little verbose. For each block (separated by \n\n), we need to split the line by a comma and turn each element into ints. Here’s my function with a big docstring and some concise parsing:

Point3D = Tuple[int, int, int]

class Solution:
    ...

    def parse_input(self) -> List[List[Point3D]]:
        """
        Parses each individual scanner block into a list of tuples

        This:

            --- scanner 0 ---
            404,-588,-901
            528,-643,409
            ...

            --- scanner 1 ---
            686,422,578
            605,423,415
            ...

        becomes:

            [
                [
                    (404, -588,-901),
                    (528, -643, 409),
                    ...
                ],
                [
                    (686, 422, 578),
                    (605, 423, 415),
                    ...
                ],
            ]
        """
        return [
            [tuple(map(int, line.split(","))) for line in block.split("\n")[1:]]
            for block in self.input.split("\n\n")
        ]

That in hand, let’s dig into the the hard part: we have to write a function that can tell if two scanners report overlapping points. The rotation made this seem really hard and, while it certainly complicates it, it’s not as bad as I thought. The key to the whole thing is that all of the points, no matter the rotation, will have the same offsets. So, if we count the distances between every combination of points reported by 2 scanners and there’s at least 12 that have the exact same offset in a given dimension, then we know how far away the scanners must be from each other. That sounds like a lot, but some examples should help clear it up.

Let’s imagine a simple case where there’s a line of beacons (B) between scanners S and T.

B....
.B...
S.B..
...BT

Scanner S reports the top 3 points as (top to bottom):

0,2
1,1
2,0
3,-1

While T reports them like so:

-4,3
-3,2
-2,1
-1,0

We don’t know where S and T are relative to each other. But, we know that if a beacon is reported by both scanners, then the difference between the two reported values will be the same across each matched beacon. We can see this in our example:

`S`’s `x` value	`T`’s `x` value	difference (`Sx - Tx`)
0	-4	4
1	-3	4
2	-2	4
3	-1	4

The same holds true for the y values:

`Sy`	`Ty`	difference
2	3	-1
1	2	-1
0	1	-1
-1	0	-1

Of course, the points aren’t sorted in our puzzle input. So, we have to compare differences between every possible pair of Sx and Tx (using a nested for loop, or the itertools.product function). Here’s the full combination table for the x values:

`Sx`	`Tx`	difference
0	-4	4
0	-3	3
0	-2	2
0	-1	1
1	-4	5
1	-3	4
1	-2	3
1	-1	2
2	-4	6
2	-3	5
2	-2	4
2	-1	3
3	-4	7
3	-3	6
3	-2	5
3	-1	4

If we run the difference column through a Counter, we get:

Counter({
    # difference : # occurrences
    4: 4,
    3: 3,
    5: 3,
    2: 2,
    6: 2,
    1: 1,
    7: 1
})

This tells us that for x values, there are 4 overlapping beacons. In the actual puzzle, we need 12, but this validates that our initial plan works! We see similar results with the y offsets counter:

Counter({
    -1: 4,
     0: 3,
    -2: 3,
     1: 2,
    -3: 2,
     2: 1,
    -4: 1
})

It’s worth noting that this approach depends on the fact that each beacon is unique in each dimension (an assumption we can verify in the example output). I suspect this approach wouldn’t be viable with a random dataset; luckily, we don’t have one of those!

Most importantly, it tells us that the T scanner is at point 4, -1 if S is at (0, 0). And that’s true!

Of course, there’s another wrinkle- the rotation/flipping of each scanner. In our last example, S and T were oriented the same way. But what about when they’re not? If we rotate T 90 degrees, suddenly the world looks very different. The original map:

B....
.B...
S.B..
...BT

becomes:

.S.B
..B.
.B..
B...
T...

when T reports it. It now announces the beacons as:

0,1
1,2
2,3
3,4

If we compare the x values like we did before, the differences are all 0! That can’t be right; that means the points are on top of each other. Instead, we have to change which columns we’re comparing to each other. The column that S sees as +x (going to the right) becomes -y (going down). What before was +y (up) now becomes +x (right). It helped me to picture it like a clock:

+y
 ^
 |
 |
 |---> +x

when rotated 90 degrees clockwise, becomes:

 |---> +x
 |
 |
 V
-y

As a result, our comparison tables must change:

`Sx`	`Ty * -1`	difference
0	-1	1
0	-2	2
0	-3	3
0	-4	4
1	-1	2
1	-2	3
1	-3	4
1	-4	5
2	-1	3
2	-2	4
2	-3	5
2	-4	6
3	-1	4
3	-2	5
3	-3	6
3	-4	7

Which counts up to:

Counter({4: 4, 3: 3, 5: 3, 2: 2, 6: 2, 1: 1, 7: 1})

This tells us that, accounting for rotation, T is still 4 from S. Comparing the Ty column to Sx yields the same result. Knowing that, we can re-orient all the points reported by T and express them using S as the basis. Now, the points that both scanners report will be identical numbers, and we can de-duplicate them! This is the key to the puzzle.

Astute readers will notice we’ve gone quite a while without actually writing any code; let’s remedy that! Now that we sort of know what we need to do, I’m going to swap over to the actual example (where scanner 0 starts with 404,-588,-901) instead of my little one. We should be developing against our actual input to make sure our solution will solve the puzzle, not my contrived example.

We’ll now write a function that’ll take a pair of scanners (List[Point3D]) and returns both:

the points of the second scanner, oriented in terms of the first
the location of the second scanner relative to the first

And we do that only if we find at least 12 overlapping points (per the prompt). We’ll need to compare each column of our source scanner with each column (including the inverses) of our potential overlap to search for our common offsets. Because we’re calculating each column independently, we’ll store them that way too (and collect the results at the bottom). Let’s step through it.

def align_scanner_reports(
    source_scanner: List[Point3D], target_scanner: List[Point3D]
) -> Tuple[List[Point3D], Optional[Point3D]]:
    # if it's a match, we'll build 3 lists of adjusted values
    aligned_target_values: List[List[int]] = []
    # the offset between the source and possible match, if matched
    target_coordinates: List[int] = []
    # once we've matched a destination column, skip it
    found_dims: Set[int] = set()

    # check each of x, y, z values independently
    for source_dim in range(3):
        # pre-assign anything we'll set in the loops below
        target_dim_values: List[int] = []
        num_shared_beacons = -1
        difference = target_dim = None

        source_dim_values = [b[source_dim] for b in source_scanner]
        for (signed_int, target_dim) in product((1, -1), range(3)):
            # bail early if we have already seen a successful match in this dimension
            if target_dim in found_dims:
                continue

            target_dim_values = [b[target_dim] * signed_int for b in target_scanner]
            differences = [
                source_val - target_val
                for source_val, target_val in product(
                    source_dim_values, target_dim_values
                )
            ]

            difference, num_shared_beacons = Counter(differences).most_common(1)[0]
            if num_shared_beacons >= 12:
                # found a valid match, stop checking other transforms
                break

There are a lot of variables there, but by-and-large, we’re doing just as we discussed. We start with the list of x values from the source scanner ([404, 528, -838, ...]), then compare that list to each column (both positive and negative) for the target scanner. If there are more than 12 overlaps, we’ve found a match! We break and continue processing. This block makes extensive use of the itertools.product function, which is a shorthand for a nested for loop. Let’s continue:

def align_scanner_reports(...):
    ...

    # check each of x, y, z values independently
    for source_dim in range(3):
        ...

        # failed to find sufficient overlap between these scanners
        if num_shared_beacons < 12:
            return [], tuple()

        # these all hold their last value from the loop when we break
        # they need to have been set to non-initial values
        assert target_dim_values
        assert difference is not None
        assert target_dim is not None

        found_dims.add(target_dim)
        aligned_target_values.append(
            [target_val + difference for target_val in target_dim_values]
        )
        target_coordinates.append(difference)

Once we’ve exited the for source_dim... loop, we check if num_shared_beacons is greater than the threshold (because we might reach this code after exhausting every option, not breaking early). If so, we can safely calculate the actual value for each beacon if this dimension. Ultimately, it ends up being some simple math:

When making guesses, we store source - target as difference (D = S - T). Once we know the difference is the correct one, we can reverse that to get the true location of every target value- it’s the difference plus the target (S = D + T).

Once we’ve done that 3 times, we can do our actual return:

def align_scanner_reports(...):
    ...

    # check each of x, y, z values independently
    for source_dim in range(3):
        ...

    # after finding matches in all 3 dimensions, we have a new set of aligned points
    assert len(aligned_target_values) == 3
    assert len(target_coordinates) == 3

    points = cast(List[Point3D], list(zip(*aligned_target_values)))
    return points, tuple(target_coordinates)

We line up our list of x values with our list of y and z and zip them into actual Point3Ds. Nice job!

All that remains is to write the function that actually manages the scanner lists. Here’s how we set that up:

scanners = self.parse_input()
beacons: Set[Point3D] = set()

# scanner 0 is always aligned
aligned_scanner_reports = [scanners[0]]
unaligned_scanners = scanners[1:]
scanner_locations: List[Point3D] = [(0, 0, 0)]

The overall flow is:

for each aligned scanner, try to align every other un-aligned scanner to it
If it’s a hit, great! Now you have another aligned scanner
Otherwise, drop the failed match into a “circle back to this” bucket
Once you’ve checked all un-aligned scanners, add your source’s beacons to the set of all beacons (using a set handles de-duplication for us)

To start, only scanner 0 is aligned (because it is the basis by which all other scanners find alignment). Here’s that code:

while aligned_scanner_reports:
    source = aligned_scanner_reports.pop()
    needs_recheck = []
    for target in unaligned_scanners:
        # 1.
        aligned_target, scanner_location = align_scanner_reports(source, target)

        if not (aligned_target and scanner_location):
           # 3.
            needs_recheck.append(target)
            continue

        scanner_locations.append(scanner_location)
        # 2.
        aligned_scanner_reports.append(aligned_target)

    unaligned_scanners = needs_recheck
    # 4.
    beacons.update(source)

So, each scanner is used as source exactly once. We check it against any unaligned scanners (with the assumption that each scanner has at least 1 overlap).

Finally, return len(beacons) gets us our answer!

Part 2

Mercifully, our previous work to find scanner locations gives us a fairly easy time in part 2. The manhattan distance between a point (Ax, Ay, Az) and another (Bx, By, Bz) is the sum of their differences in each dimension. So:

abs(Ax - Bx) + abs(Ay - By) + abs(Az - Bz)

That’s the total distance to get from each of those points to another. To find the greatest such value between any two points, we have to run every pair of points through that equation and take the biggest. Sounds like a job for product! Luckily, we have a full list of the scanner_locations already, so it comes down to a little nested list comprehension:

farthest_scanner = max(
   sum(abs(a_dim - b_dim) for a_dim, b_dim in zip(A, B))
   for A, B in product(scanner_locations, repeat=2)
)

And that should do it!

Big kudos for making it all the way down here today. Looking back, I’m pretty sure this was the single hardest day (for me). I feel like I learned a lot about how to visualize 3D space though!

`Sx`	`Tx`	difference
0	-4	4
0	-3	3
0	-2	2
0	-1	1
1	-4	5
1	-3	4
1	-2	3
1	-1	2
2	-4	6
2	-3	5
2	-2	4
2	-1	3
3	-4	7
3	-3	6
3	-2	5
3	-1	4

`Sx`	`Ty * -1`	difference
0	-1	1
0	-2	2
0	-3	3
0	-4	4
1	-1	2
1	-2	3
1	-3	4
1	-4	5
2	-1	3
2	-2	4
2	-3	5
2	-4	6
3	-1	4
3	-2	5
3	-3	6
3	-4	7

`Sx`	`Tx`	difference
0	-4	4
0	-3	3
0	-2	2
0	-1	1
1	-4	5
1	-3	4
1	-2	3
1	-1	2
2	-4	6
2	-3	5
2	-2	4
2	-1	3
3	-4	7
3	-3	6
3	-2	5
3	-1	4

`Sx`	`Ty * -1`	difference
0	-1	1
0	-2	2
0	-3	3
0	-4	4
1	-1	2
1	-2	3
1	-3	4
1	-4	5
2	-1	3
2	-2	4
2	-3	5
2	-4	6
3	-1	4
3	-2	5
3	-3	6
3	-4	7

`Sx`	`Tx`	difference
0	-4	4
0	-3	3
0	-2	2
0	-1	1
1	-4	5
1	-3	4
1	-2	3
1	-1	2
2	-4	6
2	-3	5
2	-2	4
2	-1	3
3	-4	7
3	-3	6
3	-2	5
3	-1	4

`Sx`	`Ty * -1`	difference
0	-1	1
0	-2	2
0	-3	3
0	-4	4
1	-1	2
1	-2	3
1	-3	4
1	-4	5
2	-1	3
2	-2	4
2	-3	5
2	-4	6
3	-1	4
3	-2	5
3	-3	6
3	-4	7