Calling the same Rcpp function multiple times returns different results

I wrote a parallel implementation of group sums using RcppParallel.

// [[Rcpp::depends(RcppParallel)]]
#include <Rcpp.h>
#include <RcppParallel.h>
using namespace Rcpp;
using namespace RcppParallel;

struct SumsG: public Worker
{
  const RVector<double> v;
  const RVector<int> gi;

  RVector<double> sg;

  SumsG(const NumericVector v, const IntegerVector gi, NumericVector sg): v(v), gi(gi), sg(sg) {}
  SumsG(const SumsG& p, Split): v(p.v), gi(p.gi), sg(p.sg) {}

  void operator()(std::size_t begin, std::size_t end) {
    for (std::size_t i = begin; i < end; i++) {
      sg[gi[i]] += v[i];
    }
  }

  void join(const SumsG& p) {
    for(std::size_t i = 0; i < sg.length(); i++) {
      sg[i] += p.sg[i];
    }
  }
};

// [[Rcpp::export]]
List sumsingroups(NumericVector v, IntegerVector gi, int ni) {
  NumericVector sg(ni);
  SumsG p(v, gi, sg);
  parallelReduce(0, v.length(), p);

  return List::create(_["sg"] = p.sg);
}

      

Compiled with Rcpp::sourceCpp

. Now when I call it from R sumsingroups(1:10, rep(0:1, each = 5), 2)

multiple times, I get the correct answer ( 15 40

) and then something else (usually some multiplicative from the correct answer). Running

res <- sumsingroups(1:10, rep(0:1, each = 5), 2)
for(i in 1:1000) {
    tmp <- sumsingroups(1:10, rep(0:1, each = 5), 2)
    if(res[[1]][1] != tmp[[1]][1]) break
    Sys.sleep(0.1)
}

      

aborts a random iteration returning

$sg
[1]  60 160

      

or

$sg
[1] 30 80

      

I am new to Rcpp

and RcppParallel

and have no idea what might be causing this behavior.


Refresh. Things That Didn't Help:

  • Added for (std::size_t i = 0; i < sg.length(); i++) sg[i] = 0;

    to both constructors.
  • Changed names to be different in Worker

    and in the implementation of the function.
+3


source to share


1 answer


Try it.

#include <Rcpp.h>
using namespace Rcpp;
// [[Rcpp::depends(RcppParallel)]]
#include <RcppParallel.h>

using namespace RcppParallel;
struct SumsInGroups5: public Worker
{
    const RVector<double> v;
    const RVector<int> g;

    std::vector<double> s;

    SumsInGroups5(const NumericVector v, const IntegerVector g): v(v), g(g),  s(*std::max_element(g.begin(), g.end()) + 1, 0.0){ }

    SumsInGroups5(const SumsInGroups5& p, Split): v(p.v), g(p.g), s(*std::max_element(g.begin(), g.end()) + 1, 0.0) {}

    void operator()(std::size_t begin, std::size_t end) {
        for (std::size_t i = begin; i < end; ++i) {
            s[g[i]]+=v[i];
        }

    }

    void join(const SumsInGroups5& rhs) {
        for(std::size_t i = 0; i < s.size(); i++) {
            s[i] += rhs.s[i];
        }
    }
};

// [[Rcpp::export]]
NumericVector sg5(NumericVector v, IntegerVector g) {
    SumsInGroups5 p(v, g);
    parallelReduce(0, v.length(), p);
    return wrap(p.s);
}

/*** R
a <- 1:10
g <- c(rep(0,5),rep(1,5))


bb <- lapply(1:10000,function(x)sg5(a,g))
cc<-do.call("rbind",bb)
unique(cc)
*/

      



Compared to my other attempts, this code did not produce strange results in the same cases as other code. Not really sure.

+1


source







All Articles