Count the number of ways to select two numbers in an efficient algorithm

Question

Count the number of ways to select two numbers in an efficient algorithm

I solved this problem, but I have TLE Time limit exceeded in online judge

the output of the program is right, but I think the way can be improved to be more efficient!

problem:

Considering n

integers, count the number of ways in which we can select two elements: that their absolute difference is less 32

.

In a more formal way, count the number of pairs (i, j) (1 ≤ i < j ≤ n)

such that

|V[i] - V[j]| < 32

... |X|

is an absolute value X

.

Input

The first line of input contains one integer T

, the number of test cases (1 ≤ T ≤ 128)

.

Each test case starts with an integer n (1 ≤ n ≤ 10,000)

.

The next line contains n

integers (1 ≤ V[i] ≤ 10,000)

.

Output

For each test case, print the number of pairs on one line.

my code in C ++:

int main() {
    int T,n,i,j,k,count;
    int a[10000];
    cin>>T;

for(k=0;k<T;k++)
 {   count=0;
     cin>>n;
    for(i=0;i<n;i++)
    {
      cin>>a[i];
    }
    for(i=0;i<n;i++)
    {
        for(j=i;j<n;j++)
        {
          if(i!=j)
          {
            if(abs(a[i]-a[j])<32)
                count++;
          }
        }
    }
    cout<<count<<endl;
 }
    return 0;
}

I need help, how can I solve it in a more efficient algorithm?

+3

c ++ algorithm

Oghli Jul 29. 15 at 13:43

source to share

10 answers

#include <bits/stdc++.h> 
using namespace std;

int a[10010];
int N;
int search (int x){
int low = 0;
int high = N;
while (low < high)
{
    int mid = (low+high)/2;
    if (a[mid] >= x) high = mid;
    else low = mid+1;
}
return low;
}

int main() {
    cin >> N;
    for (int i=0 ; i<N ; i++) cin >> a[i];
    sort(a,a+N);
    long long ans = 0;
    for (int i=0 ; i<N ; i++)
    {
        int t = search(a[i]+32);
        ans += (t -i - 1);
    }

    cout << ans << endl;
    return 0;
}

+3

Abdullah Abdelmonem 15 Sep At 22:10

source to share

You can sort the numbers and then use a sliding window. Starting with the smallest number, fill in the std::deque

digits until they exceed the smallest number + 31. Then, in the outer loop, for each number, update the sliding window and add a new sliding window size to the pillar. Updating the sliding window can be performed in the inner loop, first pop_front

every number that is less than the current number of the outer loop, and then push_back

every number that is not more than the current number of the outer loop + 31.

+2

Serge rogatch Jul 29. 15 at 14:02

source to share

One quicker solution would be to sort the array first, then iterate over the sorted array, and for each element, only visit the elements to its right until the difference exceeds 31.

The sort can probably be done by sorting count (since you have 1 ≤ V [i] ≤ 10,000). This way you get linear time for the sorting part. It might not be necessary though (maybe for a quick point of view it's enough to get all the points).

Alternatively, you can do a trick for the inner loop (the "right of the current element" part). Note that if S [i + k] -S [i] 32 then S [i + k] -S [i + 1] <32, where S is the sorted version of V. With this trick, the whole algorithm becomes linear.

+1

cobarzan Jul 29. 15 at 14:03

source to share

This can be done with a constant number of passes over the data and can in fact be done without compromising the "spacing" value (32 in your case). This is done by filling in an array, where a[i] = a[i-1] + number_of_times_i_appears_in_the_data

- informally, a[i]

contains the total number of elements that are less than / equal to i

.

Code (for one test case):

static int UPPER_LIMIT = 10001;
static int K = 32;
int frequencies[UPPER_LIMIT] = {0}; // O(U)
int n;
std::cin >> n;
for (int i = 0; i < n; i++) { // O(n)
 int x;
 std::cin >> x;
 frequencies[x] += 1;
}
for (int i = 1; i < UPPER_LIMIT; i++) { // O(U)
    frequencies[i] += frequencies[i-1];
}
int count = 0;
for (int i = 1; i < UPPER_LIMIT; i++) { // O(U)
  int low_idx = std::max(i-32, 0);
  int number_of_elements_with_value_i = frequencies[i] - frequencies[i-1];
  if (number_of_elements_with_value_i == 0) continue;
  int number_of_elements_with_value_K_close_to_i =
      (frequencies[i-1] - frequencies[low_idx]);
  std::cout << "i: " << i << " number_of_elements_with_value_i: " << number_of_elements_with_value_i << " number_of_elements_with_value_K_close_to_i: " << number_of_elements_with_value_K_close_to_i << std::endl;
  count += number_of_elements_with_value_i * number_of_elements_with_value_K_close_to_i;
  // Finally, add "duplicates" of i, this is basically sum of arithmetic 
  // progression with d=1, a0=0, n=number_of_elements_with_value_i
  count += number_of_elements_with_value_i * (number_of_elements_with_value_i-1) /2;
}
std::cout << count;

Complete working example on IDEone .

+1

amit Apr 10 17 at 8:56

source to share

This solution can be thought of as O (N) to process N input numbers and a constant over time to process input:

#include <iostream>
using namespace std;

void solve()
{
    int a[10001] = {0}, N, n, X32 = 0, ret = 0;
    cin >> N;
    for (int i=0; i<N; ++i)
    {
        cin >> n;
        a[n]++;
    }

    for (int i=0; i<10001; ++i)
    {
        if (i >= 32)
            X32 -= a[i-32];
        if (a[i])
        {
            ret += a[i] * X32;
            ret += a[i] * (a[i]-1)/2;
            X32 += a[i];
        }
    }
    cout << ret << endl;
}

int main()
{
    int T;
    cin >> T;
    for (int i=0 ; i<T ; i++)
        solve();
}

run this code on ideone

Explanation of solution: a[i]

shows how many times i

was in the input rows. Then you loop through the entire array and X32 keeps track of the number of elements that range from i

. The only tricky part is to calculate correctly when multiple I repeated several times a[i] * (a[i]-1)/2

. What is it.

+1

Pavel Apr 11 17 at 10:36

source to share

You can sort and then use a tear-off loop to the end when the range deviates.

int main()
{
    int t;
    cin>>t;

    while(t--){
        int n,c=0;
        cin>>n;

        int ar[n];
        for(int i=0;i<n;i++)
            cin>>ar[i];

        sort(ar,ar+n);

        for(int i=0;i<n;i++){
            for(int j=i+1;j<n;j++){
                if(ar[j]-ar[i] < 32)
                    c++;
                else
                    break;
            }
        }
        cout<<c<<endl;
    }
}

Or you can use a hash array for the range and mark the occurrence of each element and then loop around and check for each element, i.e. if x = 32-y is present or not.

+1

Harsh Khatore Apr 15. 17 at 5:02

source to share

A good approach here is to split the numbers into separate buckets:

constexpr int limit = 10000;
constexpr int diff = 32;
constexpr int bucket_num = (limit/diff)+1;
std::array<std::vector<int>,bucket_num> buckets;

cin>>n;
int number;
for(i=0;i<n;i++)
{
    cin >> number;
    buckets[number/diff].push_back(number%diff);
}

Obviously, the numbers in the same bucket are close enough to each other to meet the requirement, so we can just count all the pairs:

int result = std::accumulate(buckets.begin(), buckets.end(), 0,
                  [](int s, vector<int>& v){ return s + (v.size()*(v.size()-1))/2; });

Numbers that are in non-contiguous buckets cannot form acceptable pairs, so we can simply ignore them.

This leaves the last corner case - adjacent buckets - which can be solved in different ways:

for(int i=0;i<bucket_num-1;i++)
   if(buckets[i].size() && buckets[i+1].size())
      result += adjacent_buckets(buckets[i], buckets[i+1]);

Personally, I like the "spawn rate" approach at the scale of a single cabinet, but there may be better options:

int adjacent_buckets(const vector<int>& bucket1, const vector<int>& bucket2)
{
    std::array<int,diff> pairs{};
    for(int number : bucket1)
     {
         for(int i=0;i<number;i++)
            pairs[i]++;
     }

    return std::accumulate(bucket2.begin(), bucket2.end(), 0,
                           [&pairs](int s, int n){ return s + pairs[n]; }); 
}

This function first creates an array of "numbers from the bottom bucket that are close enough to i

" and then sums the values from that array that correspond to the top numbers of the bucket.

In general, this approach is O (N) in complexity, at best it only takes one pass, and in general should be fast enough.

An example of a working ideal

+1

Ap31 Apr 16 17 at 20:23

source to share

You have to start by sorting the input.

Then, if your inner loop detects that the distance is growing above 32, you can drop it.

0

Stef Jul 29. At 13:54

source to share

Thanks for all the time and effort to resolve this issue.

I appreciated all attempts to solve it.

After testing the answers on the online judge, I found the correct and most efficient algorithm for solving Stef Answer and AbdullahAhmedAbdelmonem's answer also pavel's solution is correct, but it is exactly the same as Stef's solution in another C ++ language.

Stef received code at run time 358 ms in online Codeforces judge and adopted.

also code AbdullahAhmedAbdelmonem got run time 421 ms in the online judge codeforces and adopted.

if they explain the algorithm in detail, then the bounty will be for one of them.

you can try your solution and send it to Codeforces online judge using this link after choosing problem E. Time limit exceeded?

also i found a great algorithmic solution and clearer use of frequency array, as well as O (n) complexity.

in this algorithm, you only need to take a specific range for each inserted element into the array, which is:

begin = element - 32  
end = element + 32

and then count the number of pairs in that range for each inserted element in the frequency array:

int main() {
    int T,n,i,j,k,b,e,count;
    int v[10000];
    int freq[10001];

    cin>>T;   
 for(k=0;k<T;k++)
 { 
    count=0;
    cin>>n;
    for(i=1;i<=10000;i++)
    {
      freq[i]=0;
    }
    for(i=0;i<n;i++)
    {
     cin>>v[i];
    }   
    for(i=0;i<n;i++)
    {
     count=count+freq[v[i]];

     b=v[i]-31;
     e=v[i]+31;

     if(b<=0)
         b=1;
     if(e>10000)
        e=10000;

     for(j=b;j<=e;j++)
      {
        freq[j]++;
      }
    }
    cout<<count<<endl;
 }
    return 0;
}

Finally, I think the best approach to solving problems like this is to use a frequency array and count the number of pairs in a specific range, since the time complexity is O (n).

0

Oghli Apr 17 17 at 9:38 am

source to share

Stef · Accepted Answer · 2015-07-29T14:42:20+0000

Despite my previous (stupid) answer, there is no need to sort the data at all. Instead, you should count the frequencies of the numbers.

Then all you have to do is keep track of the number of viable numbers to pair with, iterate over the possible values. Sorry no C ++, but java must be readable as well:

int solve (int[] numbers) {                                                 
    int[] frequencies = new int[10001];                                     
    for (int i : numbers) frequencies[i]++;                                 
    int solution = 0;                                                       
    int inRange = 0;                                                        
    for (int i = 0; i < frequencies.length; i++) {                          
        if (i > 32) inRange -= frequencies[i - 32];                         
        solution += frequencies[i] * inRange;                               
        solution += frequencies[i] * (frequencies[i] - 1) / 2;              
        inRange += frequencies[i];                                           
    }                                                                        
    return solution;                                                         
}

Count the number of ways to select two numbers in an efficient algorithm

More articles: