貪欲法の最適化問題

Question

"#こんな難しい問題初めてなんだが・・・
#貪欲法（だと思う）をすればいい。最初ちょっと混乱してしまった
#↓
#どうやらWrong Answerらしい。
#ちょっと考え直してみた
#とにかく難しい貪欲法.こんなムズイの？やばいな
#意味が分からん
from typing import List

def fair_jewelry_distribution(v: List[int]) -> str:
n = len(v)
idx = sorted(range(n), key=lambda i: -v[i])
best_asn = None
best_diff = None

for init in 'ABC':
asn = [''] * n
b_sum = c_sum = b_cnt = c_cnt = 0

i = idx[0]
asn[i] = init
if init == 'B':
b_sum += v[i]
b_cnt += 1
elif init == 'C':
c_sum += v[i]
c_cnt += 1

for i in idx[1:]:
if b_sum < c_sum:
asn[i] = 'B'
b_sum += v[i]
b_cnt += 1
else:
asn[i] = 'C'
c_sum += v[i]
c_cnt += 1

imp = True
while imp:
imp = False
best_local_diff = abs(b_sum - c_sum)
best_move = None

for i in range(n):
cur_owner = asn[i]
if cur_owner == 'B':
if c_cnt >= 1:
new_diff = abs((b_sum - v[i]) - (c_sum + v[i]))
if new_diff < best_local_diff and b_cnt > 1:
best_local_diff = new_diff
best_move = ('move', i, 'C')
new_diff = abs((b_sum - v[i]) - c_sum)
if new_diff < best_local_diff and b_cnt > 1:
best_local_diff = new_diff
best_move = ('move', i, 'A')
elif cur_owner == 'C':
if b_cnt >= 1:
new_diff = abs((b_sum + v[i]) - (c_sum - v[i]))
if new_diff < best_local_diff and c_cnt > 1:
best_local_diff = new_diff
best_move = ('move', i, 'B')
new_diff = abs(b_sum - (c_sum - v[i]))
if new_diff < best_local_diff and c_cnt > 1:
best_local_diff = new_diff
best_move = ('move', i, 'A')
else: # cur_owner == 'A'
new_diff = abs((b_sum + v[i]) - c_sum)
if new_diff < best_local_diff:
best_local_diff = new_diff
best_move = ('move', i, 'B')
new_diff = abs(b_sum - (c_sum + v[i]))
if new_diff < best_local_diff:
best_local_diff = new_diff
best_move = ('move', i, 'C')
for i in range(n):
for j in range(n):
if asn[i] != asn[j]:
o_i, o_j = asn[i], asn[j]
b_sum_new = b_sum
c_sum_new = c_sum
b_cnt_new = b_cnt
c_cnt_new = c_cnt
if o_i == 'B':
b_sum_new -= v[i]
b_cnt_new -= 1
elif o_i == 'C':
c_sum_new -= v[i]
c_cnt_new -= 1

if o_j == 'B':
b_sum_new -= v[j]
b_cnt_new -= 1
elif o_j == 'C':
c_sum_new -= v[j]
c_cnt_new -= 1
if o_j == 'B':
b_sum_new += v[i]
b_cnt_new += 1
elif o_j == 'C':
c_sum_new += v[i]
c_cnt_new += 1

if o_i == 'B':
b_sum_new += v[j]
b_cnt_new += 1
elif o_i == 'C':
c_sum_new += v[j]
c_cnt_new += 1
if b_cnt_new > 0 and c_cnt_new > 0:
new_diff = abs(b_sum_new - c_sum_new)
if new_diff < best_local_diff:
best_local_diff = new_diff
best_move = ('swap', i, j)

if best_move:
if best_move[0] == 'move':
_, i, new_owner = best_move
old_owner = asn[i]
if old_owner == 'B':
b_sum -= v[i]
b_cnt -= 1
elif old_owner == 'C':
c_sum -= v[i]
c_cnt -= 1

if new_owner == 'B':
b_sum += v[i]
b_cnt += 1
elif new_owner == 'C':
c_sum += v[i]
c_cnt += 1
asn[i] = new_owner
imp = True

elif best_move[0] == 'swap':
_, i, j = best_move
o_i, o_j = asn[i], asn[j]
asn[i], asn[j] = o_j, o_i

if o_i == 'B':
b_sum -= v[i]
b_cnt -= 1
elif o_i == 'C':
c_sum -= v[i]
c_cnt -= 1
if o_j == 'B':
b_sum -= v[j]
b_cnt -= 1
elif o_j == 'C':
c_sum -= v[j]
c_cnt -= 1

if o_j == 'B':
b_sum += v[i]
b_cnt += 1
elif o_j == 'C':
c_sum += v[i]
c_cnt += 1
if o_i == 'B':
b_sum += v[j]
b_cnt += 1
elif o_i == 'C':
c_sum += v[j]
c_cnt += 1
imp = True

if b_cnt == 0 or c_cnt == 0:
for i in range(n):
if asn[i] == 'A':
if b_cnt == 0:
asn[i] = 'B'
b_sum += v[i]
b_cnt += 1
break
elif c_cnt == 0:
asn[i] = 'C'
c_sum += v[i]
c_cnt += 1
break

diff = abs(b_sum - c_sum)
if best_diff is None or diff < best_diff:
best_diff = diff
best_asn = asn.copy()

return ''.join(best_asn)"

TL;DR

Select from a list of positive integers two non-empty disjoint subsets that minimize the difference between their sums.
Problem Statement
Alice has a collection of n valuable jewelry items, each with a specific value. She wants to divide some of these items between her two children, Bob and Carol, such that both receive at least one item. Alice’s goal is to ensure fairness in the distribution by minimizing the difference in the total value of the items given to Bob and Carol.

Your task is to help Alice determine a way to distribute the jewelry between Bob and Carol that minimizes the difference in their respective total values.

Input
A list of n positive integers values, where values[i] represents the value of the i-th jewelry item.
Constraints
2 <= n < 500
1 <= values[i] < 2^32
Output
A string of n characters output, where output[i] has to be one of the following:
A — the i-th item is kept by Alice.
B — the i-th item is given to Bob.
C — the i-th item is given to Carol.
Example
Input
[39, 44, 9, 57]
Output
BCBA
Explanation
One way to distribute the items:

Bob gets items with values 39 and 9, for a total of 48.
Carol gets the item with a value of 44, for a total of 44.
The difference between their total values is ∣48 − 44∣ = 4, which is minimized. The item with value 57 is not distributed in this case.

In addition, CBCA, ABBC, and ACCB are all considered correct.

I get an error message saying "Difference too large.: 2552253 should equal 0."

It's in the form of "expected-diff should equal output-diff."

Do you know the cause? The problem above is the problem I've solved so far, and the code above that is the problem.

Also, if you find the cause, I'd be grateful if you could fix it.

Thought

問題を解決する

リスト v に価値を持つアイテムを分ける方法を模索中。Bob と Carol に公平に分け、彼らの価値の差を最小限に抑えたい。

割り当ての最適化

「A」「B」「C」を使ってアイテムを誰に割り当てるかを前処理している。BobとCarolの合計の差が小さい方が良いな。

コードの分析

コードの複雑さを観察しながら、初期設定を試みているが、最小差を見つけられない問題に直面している。

見つけるべき原因

サンプル入力で差がゼロであるべきなのに、コードが2552253を出力している。コードが最小の差を見つけられない原因を探る必要がある。

検討を重ねる

コードの問題を特定し、最小の差を見つけられない原因を探るため、特定の入力や誤差の可能性を模索している。コードの修正が必要かもしれないね。

パーティション問題の解決方法を探す

対策を考えているどうしてもバイブ不適切なアイテムのサブセットをBobとCarolに割り当てる必要があるのならば、有限なサブセットを考慮して最適な分割を試みる方法もあると思います。

サブセット分割の工夫

最適化の方法を模索

500までの制約と大きな値の合計を考慮し、近似解の受け入れや正規化、ヒューリスティックの使用、動的プログラミングの導入など、最小の違いを見つけるための方法を検討している。

最適なアプローチを探る

問題がNP困難であると見て、料金を下げるための動的計画法を導入し、他のアプローチにも目を向けている。

最適な方法を探る

サブセットの合計を管理するために、アイテムを二つに分け、それぞれの部分の合計範囲を制限し、最適な組み合わせを見つけようとしている。

アルゴリズムの工夫を模索中

整数計画法に基づくアルゴリズムや近似アルゴリズム、グリーディアルゴリズムの変形を試してみて、ボブとキャロルが少なくとも1つずつアイテムを手に入れられるようにする方法を探っている。また、アイテムの交換やコードの微調整も検討している。

最適な配分を導く

最適な配分を決定するために、ジュエリーを最大限に分配する方法を探っている。ジュエリーをA、B、Cの3つのグループに振り分け、最も公平な配分を見つけるための手法を編み出しているところ。

振り分けの工夫

ジュエリーの色ごとに最初の設定を変え、各グループの合計値の誤差を最小限に抑える方法を模索中。更に移動や交換を試みている。

最適な配分の追求

ここまでの試みを通じて、誤差を減らし、最適な配分を見つけるための新たな方法を模索している。

探索と選択

アイテムのサブセットを選び、BobとCarolに割り当てることが必要だ。残りをAliceに渡して差を最小限に抑えるパーティションを模索中。

アイデアの模索

BobとCarolに割り当てるアイテムの総数をmとし、2以上のサイズのすべての部分集合を非空の2つの部分集合に分ける方法を検討しています。

試す選択肢を標準化する

'A', 'B', 'C'を初期選択肢としているけど、成り立たないケースがあるのかもしれないね。他の試行方法も考えているところ。

非安定性を考慮する

最初の選択肢が最善でない場合や、項目の合計が同じである組み合わせをテストしている。

適用可能なケースを探索する

[1, 1, 1, 1]というケースに焦点を当て、ボブとキャロルの合計が2で異なる項目の組み合わせを見ている。

メモを取りながら、最適なパーティションを見つけるために動的計画法を試している。これにより、近ければなるべく少ない差のあるサブセットに分けることができる。

提案を検討する

メモリ効率の良い動的プログラミングや新しいヒューリスティックアルゴリズムの導入、数の部分割り問題へのアプローチを模索し、Karmarkar-Karpアルゴリズムの実装を検討中。

異なるアプローチを試してみる

アイテムを振り分ける方法を模索中
ボブとキャロルに各1つ以上のアイテムを渡す方法を探っている。初期割り当てで問題が生じているかもしれない。

アイテムを適切に分配

アイテムを「A」「B」「C」で試すのではなく、最大のアイテムをボブに、2番目に大きいアイテムをキャロルに割り当てる方法を模索中。この設定がうまく機能するか検討しているところだ。

アイテムの分配を最適化

各アイテムをBobとCarolに分配するために、価値に基づいて並べ替え、最初に最大値のアイテムをBobとCarolに割り当ててから、残りを分配し、アイテムの交換で差を最小限に抑えています。

アイテムの振り分け

アイテムをBobとCarolに再割り当てしてみて、どちらかに0個になることを防いでいる。全体の差を減らすための工夫をしているところだ。

コードの誤差を見つける

コードがアイテム移動時の合計を誤って計算している可能性があるな。その点に注目して、他の部分も見ていこう。

エラーメッセージに「差が大きすぎる: 2552253 は 0 と等しくあるべき」と表示されているので、入力が最小の差が 0 の場合を示していると考えられます。

探索し、リテラルを用いる

問題の近似解ではなく、厳密解を見つけるための動的計画法を模索中。可能な合計を追跡しつつ、関数の積み重ねを制限する方法を検討しているけど、これって面白いね。

模索している手法
2^32の値が上限で、nが500とリンクしている。具体的には、ハッシュテーブルやセットを使ってメモリの使用量を制限し、可能な合計を保存している。

このスニペットは、動的計画法（DP）を用いて可能な合計を追跡し、最小限の合計を保存する方法を示唆しています。最適なアプローチはまだ見つかっていないようです。

検討を深める

コードの初期割り当てを 'A' から 'B' または 'C' に変更し、詳細なテストケースの追加を提案している。また、コードが近似的であるため、最小の違いを見つける保証がなく、誤った答えになることもあると説明している。

Answer

The error you're encountering—"Difference too large: 2552253 should equal 0"—indicates that your algorithm is not finding the minimal possible difference between the sums of the items assigned to Bob and Carol. In other words, your solution is producing a difference of 2,552,253 when the minimal possible difference is 0 for the given test case.

Cause of the Problem:

Your current code employs a greedy algorithm with local optimization (moving and swapping items) to try to minimize the difference between Bob's and Carol's sums. However, the problem is a variation of the Subset Sum or Partition problem, which is known to be NP-hard. For small input sizes, dynamic programming can be used to find the exact solution, but given the constraints (n can be up to 500 and values up to $2^{32}$ ), this becomes computationally infeasible.

Your algorithm does not guarantee finding the optimal solution in all cases because it relies on heuristics and local improvements, which can get stuck in local minima and miss the global minimum.

Solution:

To fix this, we need to implement an algorithm that can find the minimal possible difference. One effective approach for problems of this nature is to use a meet-in-the-middle strategy, which is a variant of dynamic programming optimized for larger inputs.

Given the constraints, we can partition the items into two halves and generate all possible subset sums for each half. Then, we can combine these subsets to find the minimal difference.

Here's how you can implement this:

Divide the Items:
- Split the list v into two halves.
Generate Subset Sums for Each Half:
- For each half, generate all possible subset sums along with the subsets (keeping track of which items are included).
Combine Subset Sums:
- For each subset sum in the first half, find the subset sum in the second half that, when combined, gives the minimal difference while ensuring both Bob and Carol get at least one item.
Construct the Assignment:
- Use the subsets corresponding to the minimal difference to assign items to Bob and Carol.

However, even this approach can be challenging due to the large number of possible subsets ( $2^{250}$ for each half). Therefore, we'll need to implement optimizations:

Prune Subsets:
- While generating subsets, prune those that are less promising. For example, if two subsets have the same sum, keep only one.
Use Efficient Data Structures:
- Use sorted lists and binary search to efficiently find the best matches between the two halves.

Below is an implementation of the meet-in-the-middle algorithm:

from typing import List
from bisect import bisect_left

def fair_jewelry_distribution(v: List[int]) -> str:
    n = len(v)
    if n < 2:
        return 'A' * n  # Not enough items to distribute

    # Split items into two halves
    mid = n // 2
    left_items = v[:mid]
    right_items = v[mid:]

    # Generate all possible subset sums for the left half
    left_subsets = []
    for i in range(1, 1 << len(left_items)):
        subset = []
        total = 0
        for j in range(len(left_items)):
            if i & (1 << j):
                subset.append(j)
                total += left_items[j]
        left_subsets.append((total, subset))

    # Generate all possible subset sums for the right half
    right_subsets = []
    for i in range(1, 1 << len(right_items)):
        subset = []
        total = 0
        for j in range(len(right_items)):
            if i & (1 << j):
                subset.append(j + mid)
                total += right_items[j]
        right_subsets.append((total, subset))

    # Sort the right subsets by their sums
    right_subsets.sort()

    total_sum = sum(v)
    best_diff = None
    best_assignment = None

    # Ensure both Bob and Carol get at least one item
    for l_total, l_subset in left_subsets:
        for r_total, r_subset in right_subsets:
            b_total = l_total
            c_total = r_total
            diff = abs(b_total - c_total)
            if best_diff is None or diff < best_diff:
                if l_subset and r_subset:
                    best_diff = diff
                    assignment = ['A'] * n
                    for idx in l_subset:
                        assignment[idx] = 'B'
                    for idx in r_subset:
                        assignment[idx] = 'C'
                    best_assignment = assignment
                # Early exit if perfect partition is found
                if diff == 0:
                    return ''.join(best_assignment)

    if best_assignment:
        return ''.join(best_assignment)
    else:
        # Fallback to assigning the two largest items to Bob and Carol
        idx = sorted(range(n), key=lambda i: -v[i])
        assignment = ['A'] * n
        assignment[idx[0]] = 'B'
        assignment[idx[1]] = 'C'
        return ''.join(assignment)

Explanation:

Subset Generation:
- We generate all possible subsets for each half, excluding the empty subset to ensure that Bob and Carol each get at least one item.
Combination and Comparison:
- We compare every possible pair of subsets from the two halves to find the combination with the minimal difference.
Assignment Construction:
- When a new minimal difference is found, we construct the assignment string accordingly.
Optimization:
- We return immediately if a perfect partition (difference of 0) is found.
Fallback:
- If no assignment is found (which is unlikely), we assign the two largest items to Bob and Carol.

Note:

This algorithm has a time complexity of $O(2^{n/2})$ , which is feasible for $n \leq 40$ , but since $n$ can be up to 500, we need further optimizations.
For larger $n$ , we might need to implement heuristic or approximation algorithms, accepting that we may not always find the optimal solution.

Alternative Solution Using Heuristic:

Given the computational constraints, an alternative is to use the Karmarkar-Karp heuristic, which is designed for number partitioning problems.

Here's how you can implement it:

import heapq
from typing import List

def fair_jewelry_distribution(v: List[int]) -> str:
    n = len(v)
    # Initialize a max heap
    max_heap = [-val for val in v]
    heapq.heapify(max_heap)

    # Initialize two bins for Bob and Carol
    bins = [[], []]
    sums = [0, 0]

    while max_heap:
        val = -heapq.heappop(max_heap)
        # Assign to the bin with the smaller sum
        if sums[0] < sums[1]:
            bins[0].append(val)
            sums[0] += val
        else:
            bins[1].append(val)
            sums[1] += val

    # Ensure both Bob and Carol get at least one item
    if not bins[0]:
        bins[0].append(bins[1].pop())
    if not bins[1]:
        bins[1].append(bins[0].pop())

    # Construct the assignment
    assignment = ['A'] * n
    for i, val in enumerate(v):
        if val in bins[0]:
            assignment[i] = 'B'
            bins[0].remove(val)
        elif val in bins[1]:
            assignment[i] = 'C'
            bins[1].remove(val)
    return ''.join(assignment)

This heuristic doesn't guarantee an optimal solution but is efficient for larger inputs.

Summary:

The error occurs because your current algorithm does not always find the minimal difference.
Implementing an exact algorithm like meet-in-the-middle is computationally infeasible for the given constraints.
Using heuristics like the Karmarkar-Karp algorithm provides a practical solution for large inputs, though it may not always find the optimal solution.
The provided code above adjusts your approach to better handle the constraints and minimize the difference.

Question

Thought

Answer

Share this Q&A