To improve the performance of bcp:
Use fast-logged bcp
Use partitioned tables – several bcp sessions with a partitioned table can dramatically reduce the time required to copy the data. However, such performance improvements are more noticeable in fast and fast-logged bcp than in slow bcp.
Use bcp in parallel to increase performance dramatically – parallel bulk copy can provide balanced data distribution across partitions. For more information, see “Using Parallel Bulk Copy to Copy Data into a Specific Partition”.
bcp does not fire any trigger that exists on the target table.