A parallel algorithm for the general LU factorization