There are three ways to improve the performance of bcp.
Fast-logged bcp
Partitioned tables – several bcp sessions with a partitioned table can dramatically reduce the time required to copy the data. However, such performance improvements are more noticeable in fast and fast-logged bcp than in slow bcp.
bcp in parallel to increase performance dramatically – parallel bulk copy can provide balanced data distribution across partitions.