How to use bigcode/santacoder-fast-inference with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="bigcode/santacoder-fast-inference")
# Load model directly from transformers import AutoTokenizer, AutoModelWithLMHead tokenizer = AutoTokenizer.from_pretrained("bigcode/santacoder-fast-inference") model = AutoModelWithLMHead.from_pretrained("bigcode/santacoder-fast-inference")